As researchers, we often hit unexpected roadblocks in the tools we use daily. Abbyy FineReader, a popular Optical Character Recognition (OCR) software, restricts automatic processing of text documents to 5,000 pages per month. While this limit might be sufficient for casual users, it can be restrictive for large-scale projects. In this post, I’ll share a simple Python script leveraging the pyautogui library to automate the GUI interactions with Abbyy FineReader, potentially allowing you to process more than the set limit.
How it Works:
The script uses pyautogui to mimic human interactions with the Abbyy FineReader software. This means it moves the mouse, clicks on specified coordinates, and even types out paths and filenames. Essentially, it automates the entire process of:
Opening files from specified directories.
Running the OCR process on them.
Saving the results in both PDF and HTML formats.
Using the Script:
Setup: Define your directory paths and specify the screen dimensions if they differ from the defaults in the script.
Run: Execute the script. Ensure Abbyy FineReader is not open to start. Once started, don't interrupt or use the computer, as the script will take control of your mouse and keyboard.
Output: Processed files will be saved in the specified formats.
Customization:
Feel free to modify the coordinates based on your screen resolution or if the layout of your Abbyy FineReader differs from mine. You can use tools like the position tracker in pyautogui to identify exact screen coordinates.
Caveats:
This script is sensitive to screen resolution and software layout. Changes in either might necessitate adjustments in the script.
For consistent results, always run the script in the same environment, preferably without other apps running in the background.
Disclaimer: This script is intended for personal use, to help automate repetitive tasks. Make sure you are not violating Abbyy FineReader’s terms of service or any other applicable laws or agreements by using or modifying this script.
Comments You need to have a GitHub Account to comment!
Comments You need to have a GitHub Account to comment!
Post comment