Note: All time is in UTC
2026/02/16
-
17:01
- The project has been created and the base documentation files and structure have been created.
-
15:18
- The README file has been created and the Local LMs have been installed and tested for functionality.
-
21:20
- Created the
get_state.pyfile to obtain the state of the webpage via the URL.
- Created the
-
21:50
- Added a
robots.txtchecker and a scroll function to avoid "lazy-loading" to get the most out of the data.
- Added a
-
04:52
- Created the
processor.pyfile for the Visual Processing module.
- Created the
-
06:53
- The
processor.pymodule now processes possible candidates for buttons in the image provided.
- The
-
20:03
- Created the skeleton of the
classifier.pymodule. - Updated the comments and documentation.
- Created the skeleton of the
-
20:58
- Finished the base of the
cassifier.pymodule and now it can function. - Now creating a
scout.pyfile to orchestrate the pipelines.
- Finished the base of the
-
21:34
- It works as intended and can now read the text on the screen.
2026/02/18
-
17:04
- Staring an AI code audit and found the following on
scout.py:- Keep browser open for agentic actions.
- Rename
scrapetoobservefor proper naming convention. - Need error handling on
get_web_state()to avoid crashes. - Lacks memory, a
current_statevariable could help. loggingis better thanprintin these circumstances.
- Staring an AI code audit and found the following on
-
17:44
- Refactored
scout.pyand now moving on toget_state.py. get_state.pyneeds the following changes:- The browser needs a seperate class to persist.
- Add a network idle to ensure all is loaded.
- Returning the
pageas well as the screenshot is better.
- Refactored
-
18:18
- Refactored the
get_state.pymodule now moving ontoprocessing.py. - The following needs to be done for
processing.py:- Keep the overlapping box that makes the most sense.
- Safety check to avoid the
ValueErrorthat would crash the program. - Read image once pass it around for efficiency.
- Refactored the
-
18:50
- Refactored the
processing.pymodule and now moving on to theclassifier.pyafter a short break.
- Refactored the
-
19:56
- The
classifier.pymodule needs the following refactors:- Regex cleaning to strip common OCR hallucinations.
- Log instead of printing.
--psm 6might be bettter than--psm 7for the config.
- The
-
20:17
- Refactored the
classifier.pymodule.
- Refactored the
-
21:56
- About to step away soon, need to finish the artifacts that appear.
2026/02/19
- 21:44
- Picking away at finding each element with many filters.
2026/02/23
- 21:11
- Created the
ARCHITECTURE.mdfile so I can stop playing with values and try a more calculated aproach... Which I should have done to begin with.
- Created the
2026/02/24
- 05:22 A rewrite based on the architecture has been finished and the elements are properly found.