问题
I am trying to encode a .mp4 video from a set of frames using FFMPEG using the libx264 codec.
This is the command I am running:
/usr/local/bin/ffmpeg -r 24 -i frame_%05d.jpg -vcodec libx264 -y -an video.mp4
I sometimes get the following error:
[libx264 @ 0xa3b85a0] height not divisible by 2 (520x369)
After searching around a bit it seems that the issue has something to do with the scaling algorithm and can be fixed by adding a -vf argument.
However, in my case I don\'t want to do any scaling. Ideally, I want to keep the dimensions exactly the same as the frames. Any advice? Is there some sort of aspect ratio that h264 enforces?
回答1:
The answer to the original question which does not want to scale the video is:
-vf "pad=ceil(iw/2)*2:ceil(ih/2)*2"
Command:
ffmpeg -r 24 -i frame_%05d.jpg -vcodec libx264 -y -an video.mp4 -vf "pad=ceil(iw/2)*2:ceil(ih/2)*2"
Basically, .h264 needs even dimensions so this filter will:
- Divide the original height and width by 2
- Round it up to the nearest pixel
- Multiply it by 2 again, thus making it an even number
- Add black padding pixels up to this number
You can change the color of the padding by adding filter parameter :color=white
. See the documentation of pad.
回答2:
Just use -2
From the scale filter documentation:
If one of the values is
-n
withn > 1
, the scale filter will also use a value that maintains the aspect ratio of the input image, calculated from the other specified dimension. After that it will, however, make sure that the calculated dimension is divisible byn
and adjust the value if necessary.
Examples
Set width to 1280, and height will automatically be calculated to preserve the aspect ratio, and the height will be divisible by 2:
-vf scale=1280:-2
Same as above, but with a declared height instead; leaving width to be dealt with by the filter:
-vf scale=-2:720
"divisible by 2"
As required by x264, the "divisible by 2 for width and height" is needed for YUV 4:2:0 chroma subsampled outputs. 4:2:2 would need "divisible by 2 for width", and 4:4:4 does not have these restrictions. However, most non-FFmpeg based players can only properly decode 4:2:0, so that is why you often see ffmpeg
commands with the -pix_fmt yuv420p
option when outputting H.264 video.
Caveat
Unfortunately you can't use -2
for both width and height, but if you already specified one dimension then using -2
is a simple solution.
回答3:
If you want to set some output width and have output with the same ratio as original
scale=720:-1
and not to fall with this problem then you can use
scale="720:trunc(ow/a/2)*2"
(Just for people searching how to do that with scaling)
回答4:
It's likely due to the the fact that H264 video is usually converted from RGB to YUV space as 4:2:0 prior to applying compression (although the format conversion itself is a lossy compression algorithm resulting in 50% space savings).
YUV-420 starts with an RGB (Red Green Blue) picture and converts it into YUV (basically one intensity channel and two "hue" channels). The Hue channels are then subsampled by creating one hue sample for every 2X2 square of that hue.
If you have an odd number of RGB pixels either horizontally or vertically, you will have incomplete data for the last pixel column or row in the subsampled hue space of the YUV frame.
回答5:
The problem with the scale
solutions here is that they distort the source image/video which is almost never what you want.
Instead, I've found the best solution is to add a 1-pixel pad to the odd dimension. (By default, the pading is black and hard to notice.)
The problem with the other pad
solutions is that they do not generalize over arbitrary dimensions because they always pad.
This solution only adds a 1-pixel pad to height and/or width if they are odd:
-vf pad="width=ceil(iw/2)*2:height=ceil(ih/2)*2"
This is ideal because it always does the right thing even when no padding is necessary.
回答6:
LordNeckbeard has the right answer, very fast
-vf scale=1280:-2
For android, dont forget add
"-preset ultrafast" and|or "-threads n"
回答7:
You may also use bitand
function instead of trunc
:
bitand(x, 65534)
will do the same as trunc(x/2)*2
and it is more transparent in my opinion.
(Consider 65534 a magical number here ;) )
My task was to scale automatically a lot of video files to half resolution.
scale=-2,ih/2
lead to slightly blurred images
reason:
- input videos had their display aspect ratio (DAR) set
scale
scales the real frame dimensions- during preview the new videos' sizes have to be corrected using DAR which in case of quite low-resoution video (360x288, DAR 16:9) may lead to blurring
solution:
-vf "scale='bitand(oh*dar, 65534)':'bitand(ih/2, 65534)', setsar=1"
explanation:
- output_height = input_height / 2
- output_width = output_height * original_display_aspect_ratio
- both output_width and output_height are now rounded to nearest smaller number divisible by 2
setsar=1
means output_dimensions are now final, no aspect ratio correction should be applied
Someone might find this helpful.
来源:https://stackoverflow.com/questions/20847674/ffmpeg-libx264-height-not-divisible-by-2