21 lines
711 B
Plaintext
21 lines
711 B
Plaintext
Outline:
|
|
|
|
recursively look for all .jpg files and store them somewhere (maybe in an array?)
|
|
|
|
start while loop (might make this a for loop per: https://www.geeksforgeeks.org/how-to-iterate-over-files-in-directory-using-python/)
|
|
|
|
store current image (path probably) in a variable
|
|
|
|
use opencv to grab bottom right corner of image (https://stackoverflow.com/a/58984370)
|
|
|
|
convert the output to greyscale then binary
|
|
|
|
use pytesseract image-to-string on output to find page number (saving this to a variable inside the loop)
|
|
|
|
rename the input file to the page number variable
|
|
|
|
iterate through the list/array of .jpg files (https://www.geeksforgeeks.org/how-to-iterate-over-files-in-directory-using-python/)
|
|
|
|
end loop
|
|
|