torchcodec-xpu: add XPU encoding support by eromomon · Pull Request #58 · intel/torchlib-xpu

eromomon · 2026-05-25T23:07:28Z

Extends VideoEncoder to support Intel XPU devices.

Encoder.cpp: extended kStableCUDA device-type checks to also match kStableXPU in both VideoEncoder and MultiStreamEncoder, enabling the hardware encoding path (hw frame context setup, pixel format selection, device registration).
XpuDeviceInterface: implemented setupHardwareFrameContextForEncoding and convertTensorToAVFrameForEncoding. RGB→NV12 conversion is done via a SYCL kernel, or via libswscale as CPU fallback.

Signed-off-by: Edgar Romo Montiel <edgar.romo.montiel@intel.com>

dvrogozh · 2026-05-26T22:53:30Z

+// ============================================================
+// Encoding: encodeConvert_SYCL
+// ============================================================
+UniqueAVFrame XpuDeviceInterface::encodeConvert_SYCL(


Suggested change

UniqueAVFrame XpuDeviceInterface::encodeConvert_SYCL(

UniqueAVFrame XpuDeviceInterface::convertTensorToAVFrameForEncoding_SYCL(

dvrogozh · 2026-05-26T23:02:59Z

+  UniqueAVFrame vaFrame(av_frame_alloc());
+  TORCH_CHECK(vaFrame != nullptr, "Failed to allocate AVFrame for encoding");
+  vaFrame->format = AV_PIX_FMT_VAAPI;
+  vaFrame->height = static_cast<int>(tensor.sizes()[1]);
+  vaFrame->width  = static_cast<int>(tensor.sizes()[2]);
+  vaFrame->pts    = frameIndex;
+
+  // Allocate a VAAPI surface from the hw_frames_ctx pool created in
+  // setupHardwareFrameContextForEncoding.
+  int ret = av_hwframe_get_buffer(codecContext->hw_frames_ctx, vaFrame.get(), 0);
+  TORCH_CHECK(
+      ret >= 0,
+      "av_hwframe_get_buffer failed: ",
+      getFFMPEGErrorStringFromErrorCode(ret));


Make a helper function out of this block:

UniqueAVFrame allocNV12Frame(int width, int height, int frameIndex) {...}

dvrogozh · 2026-05-26T23:13:33Z

+  if (xpu::use_sycl_color_conversion_kernel()) {
+    VLOG(9) << "[XPU Encoder] Encoding frame " << frameIndex
+            << " via SYCL on device=xpu:" << device_.index();
+    return encodeConvert_SYCL(tensor, codecContext, std::move(vaFrame));


Name functions as convertTensorToAVFrameForEncoding_SYCL and convertTensorToAVFrameForEncoding_CPU

Do NOT pass std::move(avFrame) just to return it from the function - that's bad pattern. Instead allocate frame in the function. That's why you need a helper allocNV12Frame() to avoid duplicated code. So the functions prototype should be the same as original convertTensorToAVFrameForEncoding().

You do not need WITH_SYCL_KERNELS here - you can handle all that inside the convertTensorToAVFrameForEncoding_SYCL(). I.e.:

UniqueAVFrame avFrame = convertTensorToAVFrameForEncoding_SYCL(); if (!avFrame) { avFrame = convertTensorToAVFrameForEncoding_CPU(); }

dvrogozh · 2026-05-26T23:15:35Z

+    const torch::stable::Tensor& tensor,
+    AVCodecContext* codecContext,
+    UniqueAVFrame vaFrame) {
+#ifdef WITH_SYCL_KERNELS


Do the same as in decoding path:

if (!xpu::use_sycl_color_conversion_kernel()) { return nullptr; } if (!has_fp64_) { return nullptr; } UniquAVFrame avFrame; #ifdef WITH_SYCL_KERNELS avFrame = allocNV12Frame(); .... #endif return avFrame;

dvrogozh · 2026-05-26T23:18:33Z

+  //  Layout A: 1 layer,  2 planes  — layers[0].planes[0]=Y, layers[0].planes[1]=UV
+  //  Layout B: 2 layers, 1 plane each — layers[0].planes[0]=Y, layers[1].planes[0]=UV
+  const bool layoutA = (desc.num_layers == 1 && desc.layers[0].num_planes == 2);
+  const bool layoutB = (desc.num_layers == 2 && desc.layers[0].num_planes == 1


well. Yes, except that we don't have any other driver which has another layout... I am not sure that we should implement something which we never tested.

dvrogozh · 2026-05-26T23:20:31Z


  void registerHardwareDeviceWithCodec(AVCodecContext* codecContext) override;

+  // ---- Encoding overrides ----


If you added "Encoding overrides", then probably you need to add "Decoding overrides" as well.

How to reproduce FFmpeg RGB->YUV matrix values 1. Expose ff_fill_rgb2yuv_table in libavfilter/libavfilter.v: add "ff_fill_rgb2yuv_table;" under the global section. Example: libavfilter/libavfilter.v LIBAVFILTER_MAJOR { global: avfilter_*; av_*; + ff_fill_rgb2yuv_table; local: *; }; 2. Rebuild FFmpeg: cd ffmpeg && ./configure && make -j$(nproc) && make install nm -D <prefix>/lib/libavfilter.so | grep ff_fill_rgb2yuv_table 3. Create rgb2yuv_test.c calling ff_fill_rgb2yuv_table(av_csp_luma_coeffs_from_avcsp(cs), m) for AVCOL_SPC_BT709, BT470BG. 4. Build: gcc rgb2yuv_test.c -o rgb2yuv_test \ -I<prefix>/include -L<prefix>/lib \ -lavfilter -lavutil -Wl,-rpath,<prefix>/lib Signed-off-by: Edgar Romo Montiel <edgar.romo.montiel@intel.com>

How to reproduce FFmpeg RGB->YUV matrix values 1. Expose ff_fill_rgb2yuv_table in libavfilter/libavfilter.v: add "ff_fill_rgb2yuv_table;" under the global section. 2. Rebuild FFmpeg: cd ffmpeg && ./configure && make -j$(nproc) && make install nm -D <prefix>/lib/libavfilter.so | grep ff_fill_rgb2yuv_table 3. Create rgb2yuv_test.c calling ff_fill_rgb2yuv_table(av_csp_luma_coeffs_from_avcsp(cs), m) for AVCOL_SPC_BT709, BT470BG. 4. Build: gcc rgb2yuv_test.c -o rgb2yuv_test \ -I<prefix>/include -L<prefix>/lib \

eromomon added 2 commits May 25, 2026 15:34

Add XPU encoding support to Encoder

f317814

Signed-off-by: Edgar Romo Montiel <edgar.romo.montiel@intel.com>

Update patch and constant for context sw_pixel conversion in XPU case.

ad842da

Signed-off-by: Edgar Romo Montiel <edgar.romo.montiel@intel.com>

eromomon requested review from dvrogozh and luis-real as code owners May 25, 2026 23:07

eromomon requested review from dvrogozh and removed request for dvrogozh May 25, 2026 23:07

dvrogozh changed the title ~~Add XPU encoding support to Encoder~~ torchcodec-xpu: add XPU encoding support May 26, 2026

dvrogozh requested changes May 26, 2026

View reviewed changes

eromomon added 2 commits May 28, 2026 08:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

torchcodec-xpu: add XPU encoding support#58

torchcodec-xpu: add XPU encoding support#58
eromomon wants to merge 4 commits into
intel:mainfrom
eromomon:eromomon/encoding

eromomon commented May 25, 2026

Uh oh!

dvrogozh May 26, 2026

Uh oh!

dvrogozh May 26, 2026

Uh oh!

dvrogozh May 26, 2026

Uh oh!

dvrogozh May 26, 2026

Uh oh!

dvrogozh May 26, 2026

Uh oh!

dvrogozh May 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	UniqueAVFrame XpuDeviceInterface::encodeConvert_SYCL(
	UniqueAVFrame XpuDeviceInterface::convertTensorToAVFrameForEncoding_SYCL(


		void registerHardwareDeviceWithCodec(AVCodecContext* codecContext) override;

		// ---- Encoding overrides ----

Conversation

eromomon commented May 25, 2026

Uh oh!

dvrogozh May 26, 2026

Choose a reason for hiding this comment

Uh oh!

dvrogozh May 26, 2026

Choose a reason for hiding this comment

Uh oh!

dvrogozh May 26, 2026

Choose a reason for hiding this comment

Uh oh!

dvrogozh May 26, 2026

Choose a reason for hiding this comment

Uh oh!

dvrogozh May 26, 2026

Choose a reason for hiding this comment

Uh oh!

dvrogozh May 26, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants