21 lines
711 B
Plaintext
21 lines
711 B
Plaintext
|
Outline:
|
||
|
|
||
|
recursively look for all .jpg files and store them somewhere (maybe in an array?)
|
||
|
|
||
|
start while loop (might make this a for loop per: https://www.geeksforgeeks.org/how-to-iterate-over-files-in-directory-using-python/)
|
||
|
|
||
|
store current image (path probably) in a variable
|
||
|
|
||
|
use opencv to grab bottom right corner of image (https://stackoverflow.com/a/58984370)
|
||
|
|
||
|
convert the output to greyscale then binary
|
||
|
|
||
|
use pytesseract image-to-string on output to find page number (saving this to a variable inside the loop)
|
||
|
|
||
|
rename the input file to the page number variable
|
||
|
|
||
|
iterate through the list/array of .jpg files (https://www.geeksforgeeks.org/how-to-iterate-over-files-in-directory-using-python/)
|
||
|
|
||
|
end loop
|
||
|
|