From 5e505560cb6df27a318726242ed3f7e602217145 Mon Sep 17 00:00:00 2001 From: Kabooshki Date: Mon, 29 Apr 2024 12:52:00 -0500 Subject: [PATCH] added TODO --- TODO | 20 ++++++++++++++++++++ 1 file changed, 20 insertions(+) create mode 100644 TODO diff --git a/TODO b/TODO new file mode 100644 index 0000000..e8409f3 --- /dev/null +++ b/TODO @@ -0,0 +1,20 @@ +Outline: + +recursively look for all .jpg files and store them somewhere (maybe in an array?) + +start while loop (might make this a for loop per: https://www.geeksforgeeks.org/how-to-iterate-over-files-in-directory-using-python/) + +store current image (path probably) in a variable + +use opencv to grab bottom right corner of image (https://stackoverflow.com/a/58984370) + +convert the output to greyscale then binary + +use pytesseract image-to-string on output to find page number (saving this to a variable inside the loop) + +rename the input file to the page number variable + +iterate through the list/array of .jpg files (https://www.geeksforgeeks.org/how-to-iterate-over-files-in-directory-using-python/) + +end loop +