feat(video_composition): support of composing video frames from multiple video segments by ekuleshov · Pull Request #129 · hm21/pro_video_editor

ekuleshov · 2026-05-06T20:17:51Z

Description

This is a first a bit rough cut of the video rendering pipeline update with support for composing video frames from multiple video segments. @hm21 please review.

the VideoSegment model is updated with a few new attributes
platform rendering code is updated to support those for iOS, MacOS and Android
added pip, stack and grid examples to demonstrate several applications of the updated rendering

Related Issue: discussed here

The Android implementation currently does sub-optimal audio re-encoding. I've hit number of issues with Media3 encoders not liking leading or trailing blank/silent gaps and choking on audio encoding even when track has no audio. So, the Android code is currently re-encoding audio if there are non-sequential segments present.

I've had to "out of scope" zIndex support for the image layers. Couldn't get it work in the Media3 API on Android from a few tries and apparently I don't really needed that for my own use cases. So, the image layers are applied at the end and on top of the composed video frames.

The current implementation should make it easy to add per-segment transformations similar to what are defined globally (flip, color mask, rotate, etc), but I also left that for the future.

Type of Change

✨ New feature (non-breaking change which adds functionality)
🛠️ Bug fix (non-breaking change which fixes an issue)
❌ Breaking change (fix or feature that would cause existing functionality to change)
🧹 Code refactor
✅ Build configuration change
📝 Documentation
🗑️ Chore

…ple video segments

hm21

Thanks for working on this huge improvement. I left a few comments, but overall it looks good to me and will improve that package a lot.

What I didn’t have time for yet is testing it extensively, but in my very quick tests everything worked perfectly fine. Btw I also created a prerelease pro_video_editor: 2.0.0-beta.1 for it so other users can start testing it as well.

hm21 · 2026-05-09T08:16:44Z

-            outputImage = rotatedImage.transformed(by: translation)
-            center = CGPoint(x: outputImage.extent.midX, y: outputImage.extent.midY)
-        }
+        // 7. Apply global effects (if any)


Regression: global crop is no longer applied.

The previous implementation contained a block here that consumed cropX/cropY/cropWidth/cropHeight and called outputImage.cropped(to:). It was removed in this PR, but applyCrop() still populates these fields. As a result, any render call that uses cropping only ends up with a modified finalRenderSize (black bars / squashed output) — the image content is never cropped. Please re-introduce a crop step inside startRequest.

Sorry, I missed that. Could you point me to an example or a test that uses that?

hm21 · 2026-05-09T08:17:45Z

+        context: Context,
+        useHdr: Boolean,
+        private val effect: VideoCompositionTransformation
+    ) : BaseGlShaderProgram(useHdr, /* texturePoolCapacity= */ 1) {


texturePoolCapacity = 1 disables pipelining for this effect. Was this intentional, or just copied from a sample? A comment would help explain the trade-off.

Mostly copied from some samples... I was fighting with transparency on overlapping videos, as well as some time-gaps between video overlays, and this is the only approach that I could get to work.

hm21 · 2026-05-09T08:17:45Z

+    this.offset,
+    this.size,
+    this.zIndex,
+    this.opacity,


Please add asserts for the new fields, e.g. opacity ∈ [0, 1] and size.width/height >= 0. Bad input here surfaces only as cryptic native errors.

Also: toMap() serializes offset/size as nested maps, while toAsyncMap() flattens them into x/y/width/height. That divergence is confusing — consider unifying or commenting why the platform channel uses a different schema.

I think I copied one of them from other models, effects or images. Can change them both to be the flat (or maps), if I'm not mistaken, I copied that stuff from other models that had similar attributes (either image overlays or effects). These values/structures seems simple enough to keep them flat to reduce some complexity.

hm21 · 2026-05-09T08:17:45Z

      'scaleX': scaleX,
      'scaleY': scaleY,
+      'renderWidth': qualityConfig?.resolution?.width,
+      'renderHeight': qualityConfig?.resolution?.height,


Behaviour change worth flagging in the CHANGELOG.

Removing the qualityConfig → scaleX/scaleY derivation is cleaner, but users who previously relied on qualityConfig.resolution (without setting explicit scaleX/scaleY) will now get the resolution forwarded as renderWidth/renderHeight instead of an implicit scale. iOS turns that into intendedRenderSize-based scale correction; Android forwards it to VideoSequenceBuilder. The output may differ slightly from previous releases, so please call this out as a (potentially) breaking change.

Are there example or test that relied on that scaleX/scaleY?

I find these too confusing and hard to deal with, even for the image overlays. It seems much simpler to be explicit - calculate and specify sizes/offsets of individual image overlays or video segments based on video metadata.

In my tests, the implicit scale didn't work for all cases, e.g. when target video frame needs to be bigger.
Though when not specified, the first video segment's resolution is used.

hm21 · 2026-05-09T08:17:45Z

+                // Clear the target framebuffer to transparent before drawing the segment.
+                // This ensures that segments that don't cover the full canvas don't show garbage.
+                GLES20.glClearColor(0f, 0f, 0f, 0f)
+                GLES20.glClear(GLES20.GL_COLOR_BUFFER_BIT)


glClear inside drawFrame is suspicious. If Media3 reuses the same FBO across multiple segment effects in the same frame, this clear will wipe out previously drawn segments and you'd only ever see the last one. Please verify by rendering a stack/grid composition and checking that all segments are visible. Clearing should typically be done once per output frame, not per segment effect.

I'll check and will poke around glClear. This code was also to make gaps between videos transparent, e.g. when higher zIndex video segment starts later than segments behind it. The Media3 default processing wasn't making it easy to deal with.

The new grid example has several overlapping video segments. Then transparent video on top of them. And then transparent image overlay on top of them all. That seem to be working as expected (you may need to maximize video player to see the transparency).

hm21 · 2026-05-09T08:17:45Z

+                    y = (clipMap["y"] as? Number)?.toDouble(),
+                    width = (clipMap["width"] as? Number)?.toDouble(),
+                    height = (clipMap["height"] as? Number)?.toDouble(),
+                    zIndex = (clipMap["zIndex"] as? Number)?.toInt(),


Android silently ignores zIndex.

The PR description mentions this is out of scope, but right now a user can pass zIndex and it has no effect on Android with no indication. Please at least Log.w(...) when clip.zIndex != null and document the limitation on the Dart VideoSegment.zIndex field ("Currently not supported on Android — image layers always render on top").

Right. This is probably a left-over from my failed attempts to make zIndex work for image overlays. I don't think this field is propagated from the Dart code and it isn't used in the rendering pipeline.

I can remove it or add some todos to Android implementation.

hm21 · 2026-05-09T08:17:45Z

+            var zIndex: Int {
+                switch self {
+                    case .video(_, let clip, _): return clip.zIndex ?? 0
+                    case .imageLayer: return Int.max


Image-layer z-order is hard-coded to Int.max.

This forces every image layer to render above all video segments, regardless of segment zIndex. Now that segments expose zIndex, image layers should participate in the same ordering (or at least be configurable). Otherwise users cannot put a video segment on top of an image overlay.

Right. The ios/macos platform code actually has working zIndex support implemented for image layers and this switch basically disables it to make image overlays to always render on top, to match same behavior across platforms.

To enable it back just have to propagate value from Dart and replace .max here with an actual zIndex. Perhaps also add todo or some comment here?

hm21 · 2026-05-09T08:17:57Z

Image-layer z-order is hard-coded to Int.max in the new RenderableItem enum.

This forces every image layer to render above all video segments, regardless of segment zIndex. Now that segments expose zIndex, image layers should participate in the same ordering (or at least be configurable). Otherwise users cannot put a video segment on top of an image overlay.

Correct, I mentioned this in PR notes. It is working on Apple platforms, but not on Android, so currently disabled to work the same between platforms.

hm21 · 2026-05-09T08:29:09Z

Integration tests for the new composition features are missing.

example/integration_test/video_merge_test.dart only covers the previous VideoSegment fields (startTime/endTime/volume). Please add coverage for the new functionality introduced here, ideally in a new file like example/integration_test/video_composition_test.dart. Suggested cases:

segmentTime (absolute placement)

Two segments with overlapping segmentTime → assert output duration and that both rendered without crash.

Single segment with segmentTime: Duration(seconds: 5) and no preceding segment → expect a black gap before it.

offset + size (PIP)

Background segment full-frame + second segment with offset: Offset(20, 20), size: Size(w/2, h/2) → sample frame bytes inside vs. outside the PIP rectangle and assert they differ; output resolution must match the background.

zIndex

Two full-frame segments with different zIndex → assert the higher-zIndex segment is visible by sampling pixels.

On Android, expect the field to be tolerated (no crash); mark with skip: or a platform guard until Android support lands.

opacity

Segment with opacity: 0.5 over a solid-color background → assert sampled pixels lie within the expected blended range.

zIndex ordering between video segments and image layers — only relevant once the hard-coded Int.max issue is fixed; can be a follow-up.

Regression tests for the issues raised in this review (once those fixes land):

Single-segment render with cropWidth/cropHeight → frame bytes match expected crop region.

Single-segment render with rotateTurns: 1 → resolution swap + top-left pixel sample.

Multi-segment + rotateTurns: 1 combination.

Platform guards: skip iOS/macOS-only or Android-only cases where the feature isn't supported yet.

At minimum, (1)–(4) should land in this PR as smoke tests for the new fields; (5) and (6) can be follow-ups.

hm21 · 2026-05-09T08:30:33Z

README needs to be updated for the new VideoSegment fields.

The VideoSegment API reference (around lines 526–550 of README.md on stable) currently only documents video, startTime, endTime. Please extend it with the new fields introduced here — volume, offset, size, zIndex, opacity, segmentTime — including the platform support matrix (e.g. note that zIndex is currently iOS/macOS-only) and ideally a short PIP / stack example mirroring the new _combinedPip demo.

Also worth a paragraph in the "Render" section about composing multiple overlapping segments, since that's the headline feature of this PR.

ekuleshov added 2 commits May 6, 2026 16:01

feat(video_composition): support of composing video frames from multi…

c189db6

…ple video segments

cleanup

1c3cfe8

hm21 reviewed May 9, 2026

View reviewed changes

Uh oh!

Conversation

ekuleshov commented May 6, 2026

Description

Type of Change

Uh oh!

hm21 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants