Skip to content

Commit 55438ae

Browse files
added frame count section (#1745)
1 parent dd7cd11 commit 55438ae

File tree

1 file changed

+12
-1
lines changed

1 file changed

+12
-1
lines changed

docs/en/ocr_benchmark.md

Lines changed: 12 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -96,7 +96,18 @@ Note: don't forget to count multi-frame images.
9696
If you don't know the dimensions, but you know that your images are also uncompressed, you can use file size as proxy.
9797
_Keep in mind that if you actually have compressed images you will be underestimating the processing time_.
9898

99-
99+
#### Why the numbers are related to uncompressed images
100+
The variation of the effective compression levels in real datasets makes it impractical to use such a metric in an estimation. Some datasets with low density of information can get compression levels up to 30X, while some others can have lower compressions, so picking a reference dataset is very difficult for the general case. </br>
101+
But not all hope is lost, you can work with frames!.
102+
103+
104+
#### Using total frame count as a proxy for estimation
105+
If you don't know the size of the images, or the compression, you can estimate the number of frames and use the time it takes for a single frame(like the numbers we've shared in this document) as a proxy.
106+
</br>
107+
Reasons for doing this:
108+
* You can compute frame count and sizes very efficiently using Visual NLP.
109+
* Even if not all frames are of the same dimensions, you can resize them prior to feeding them to the ML models in the pipeline.
110+
* This way, each frame will have a fixed size processing time.
100111

101112

102113
</div>

0 commit comments

Comments
 (0)