问题
The Google Pixel 2 and probably other phones since have the capability to cover "Motion Photos". These are saved as MVIMG and comparatively big.
I’m looking for a way to remove/extract the video.
So far I found a promising exif tag
$ exiftool -xmp:all MVIMG_123.jpg
XMP Toolkit : Adobe XMP Core 5.1.0-jc003
Micro Video : 1
Micro Video Version : 1
Micro Video Offset : 4032524
I thought the video might be present at the specified offset, but this doesn’t work:
$ dd if=MVIMG_123.jpg of=video.mp4 bs=4032524 skip=1
$ file video.mp4
video.mp4: data
Are there any resources that document the embedding? Are there even any tools to remove/extract the video?
回答1:
I did find https://github.com/cliveontoast/GoMoPho which scans for the mp4 header and then dumps the video.
We can do the same, scanning for ftypmp4
from the MP4 header (actual file starts 4 bytes earlier):
Thus to extract videos:
for i in MVIMG*.jpg; do \
ofs=$(grep -F --byte-offset --only-matching --text ftypmp4 "$i"); \
ofs=${ofs%:*}; \
[[ $ofs ]] && dd "if=$i" "of=${i%.jpg}.mp4" bs=$((ofs-4)) skip=1; \
done
And to remove videos:
for i in MVIMG*.jpg; do \
ofs=$(grep -F --byte-offset --only-matching --text ftypmp4 "$i"); \
ofs=${ofs%:*}; \
[[ $ofs ]] && truncate -s $((ofs-4)) "$i"; \
done
回答2:
The EXIF tag is useful, but the offset is with the respect to the end of the file. The mp4 file is embedded at:
[file_size-micro_video_offset, file_size)
For example:
$ exiftool -xmp:all MVIMG_123.jpg
XMP Toolkit : Adobe XMP Core 5.1.0-jc003
Micro Video : 1
Micro Video Version : 1
Micro Video Offset : 2107172
Micro Video Presentation Timestamp Us: 966280
$ python -c 'import os; print os.path.getsize("MVIMG_123.jpg") - 2107172'
3322791
$ dd if=MVIMG_123.jpg of=video.mp4 bs=3322791 skip=1
$ file video.mp4
video.mp4: ISO Media, MP4 v2 [ISO 14496-14]
回答3:
The above suggestion using grep -F --byte-offset ...
and dd
does not work for me on macOS High Sierra as /usr/bin/grep
outputs a wrong offset — I guess it yields the offset of the "line" which contains the word ftypmp4
, i.e. the position of the previous LF character plus one. I might guess wrong, but anyway, this is my solution:
for i in MVIMG*.jpg; do \
perl -0777 -ne 's/^.*(....ftypmp4.*)$/$1/s && print' "$i" >"${i%.jpg}.mp4"; \
done
This uses the ability of perl
to slurp in a whole file at once and treat it as one big string. If no ftypmp4
with at least four leading bytes is present, an empty file is created, if multiple are present, the last one is extracted.
Similarly, to remove the video from all the files:
for i in MVIMG*.jpg; do \
perl -0777 -pi -e 's/^(.*?)....ftypmp4.*$/$1/s' "$i"; \
done
This uses the in-place editing feature of perl
. Everything after the first occurrence of ftypmp4
with its four leading bytes is cut off. If there are no occurrences, the file is rewritten with its contents unchanged.
(One might or might not need to set PERLIO=raw in the environment and/or unset the locale related variables to avoid UTF-8 interpretation, which could fail for binary files which happen to include byte sequences that violate the UTF-8 composition rules. In my tests with various MVIMG files no such problems occurred though.)
来源:https://stackoverflow.com/questions/53104989/how-to-extract-the-photo-video-component-of-a-mvimg