Ocr does not start from 0 #6

Alexisback · 2025-02-27T17:07:57Z

Congratulations for the program, really well done and working.
There are some things I would like to report.

Ocr does not start from 0 but after 5 seconds, there are some subs that it leaves out in this case.
I would like the subs to be generated like the originals, if there are line changes then the program should also do it.

I don't know anything else.. I'm testing the program (the desktop version VSE-windows-cpu) and in case of other doubts I will insert them here
Thanks

Ps: Some guidance for the Preferences menu would be helpful

voun7 · 2025-02-27T20:26:40Z

I have not experienced this. Does this happen with just one video?
I'm not sure what you mean by line changes, but srt doesn't allow for much customization of text.

Which part of the preference menu do you need help with?

Alexisback · 2025-02-28T05:03:40Z

Hi voun7

He does this with all his videos..
I'm referring to the time cursor, 00.00.00.000 does not correspond exactly to the beginning,
I open the video and even if it says 00.00.00.000 it is actually 5 seconds ahead, moving with <--- on the keyboard the video should stop at the beginning but instead it continues from the end,
maybe it's a bug in the cursor

voun7 · 2025-02-28T06:25:11Z

Sorry about that. I have been unable to reproduce this, and I have not experienced it before. The video in the project's test folder also doesn't give this problem. Could you make a similar above image with this video.

Alexisback · 2025-02-28T06:57:51Z

Try this , You will see that ocr starts after 5 seconds, exactly from Sceneggiatura, but before that there is some more text
https://odysee.com/@cinemarusso:c/Winnie-Puh:7

you can download it from here

Alexisback · 2025-02-28T14:36:15Z

I don't see the other OCR languages

voun7 · 2025-02-28T15:43:43Z

Use the latest version to see all supported languages.

Something is definitely wrong with the encoding of the Winnie Puh video or the way the site encodes downloads. I had to download it 3 times before I could get a copy of the video that wasn't frozen towards the end.

The video is processed by OpenCV in a strange way. The first frame OpenCV sees is not the first frame of the video when played with a media player.

import cv2 as cv

file = "Winnie Puh-sub.mp4"
capture = cv.VideoCapture(file)

capture.set(cv.CAP_PROP_POS_FRAMES, 0)  # 0 should represent the first frame of the video

_, image = capture.read()

cv.imshow(file, image)  # but the frame show isn't the first frame
cv.waitKey()
cv.destroyAllWindows()
capture.release()

For now, I'm unable to fix this. Try videos from other sites and see if the same thing happens. As for the line changes you mentioned previously, after seeing this video, I think I understand what you mean. You want the extracted subtitles to maintain their different lines of the text instead of being joined on the same line. I should be able to do something about that.

Alexisback · 2025-02-28T17:29:52Z

I downloaded the latest version and I see all the ocr languages now.
If you have problems downloading from the browser try with yt-dlp (it is on Github)

yt-dlp.exe https://odysee.com/@cinemarusso:c/Winnie-Puh:7

exactly you understood, the generated subs are on the same line, it would be convenient to place them like the original in a new line or many

Before your software I used VideoSubFinder, great program but requires an external ocr and many steps before getting the result.
This allows you to load the video with both OpenCV and FFMPEG.
Maybe the problem is OpenCV?

https://www.videohelp.com/software/VideoSubFinder

anyway even with that and open with FFMPEG the OCR is positioned a little ahead, 2 seconds against the 5-8 of Video_Sub_Extractor

I'll try with other videos and let you know.

Thanks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ocr does not start from 0 #6

Ocr does not start from 0 #6

Alexisback commented Feb 27, 2025 •

edited

Loading

voun7 commented Feb 27, 2025 •

edited

Loading

Alexisback commented Feb 28, 2025 •

edited

Loading

voun7 commented Feb 28, 2025

Alexisback commented Feb 28, 2025 •

edited

Loading

Alexisback commented Feb 28, 2025

voun7 commented Feb 28, 2025 •

edited

Loading

Alexisback commented Feb 28, 2025 •

edited

Loading

Ocr does not start from 0 #6

Ocr does not start from 0 #6

Comments

Alexisback commented Feb 27, 2025 • edited Loading

voun7 commented Feb 27, 2025 • edited Loading

Alexisback commented Feb 28, 2025 • edited Loading

voun7 commented Feb 28, 2025

Alexisback commented Feb 28, 2025 • edited Loading

Alexisback commented Feb 28, 2025

voun7 commented Feb 28, 2025 • edited Loading

Alexisback commented Feb 28, 2025 • edited Loading

Alexisback commented Feb 27, 2025 •

edited

Loading

voun7 commented Feb 27, 2025 •

edited

Loading

Alexisback commented Feb 28, 2025 •

edited

Loading

Alexisback commented Feb 28, 2025 •

edited

Loading

voun7 commented Feb 28, 2025 •

edited

Loading

Alexisback commented Feb 28, 2025 •

edited

Loading