
In Progress
Posted
Paid on delivery
NOTE!! BUDGET IS $30 FOR THIS PROJECT MAX. Description: I need a Python developer to build and polish an automation script for macOS (Apple M2, Sonoma 14.6.1). The script should: Hotkey Trigger – When I press /, the script should run. Screen Capture & OCR – Take a screenshot of my current screen and extract: The question text The multiple-choice options (A, B, C, D, E) The screen coordinates of each choice (so the script knows where to click). AI Integration – Send the question + choices to an AI model (Together AI API, Hugging Face API, or local Ollama model) and get back the predicted correct answer (A–E). Mouse Click – Automatically move the mouse to the detected coordinates of the correct choice and click it. Error Handling – Handle cases where OCR doesn’t detect choices properly. Ensure correct Retina scaling for Mac (so clicks align with the right spot on screen). Configurable AI – Make it easy for me to swap between AI providers (Together AI key, Hugging Face key, or local Ollama). Requirements: Strong experience in Python automation. Familiar with OCR libraries (Tesseract, EasyOCR, or PaddleOCR). Experience with AI APIs (Together AI, Hugging Face, or Ollama). Familiar with PyAutoGUI or similar libraries for mouse/keyboard automation. Must ensure Retina display scaling works properly on macOS. Deliverables: A polished, documented Python script I can run easily from Terminal. Setup instructions (dependencies, virtual environment). Clear README so I can switch between AI backends. Note: This is for my personal tests that I make. No specific time limit. Can you make the script work with both DOM parsing (Playwright/Selenium) as the first method and OCR with bounding boxes as fallback if the site uses images or PDFs? Will the script handle macOS Retina scaling correctly so the cursor lands exactly on the right A/B/C/D option every time? Can you set it so the hotkey is “/” globally (no matter which app/window is active)? Will the script allow me to configure the AI backend (Together AI, ChatGPT/OpenAI, or DeepSeek) via a simple config file or .env? For OCR, will you use something reliable like PaddleOCR or Tesseract, and will it also give me the bounding box coordinates? Can the cursor move automatically to the detected answer without clicking (just hovering over it)?
Project ID: 39726795
8 proposals
Remote project
Active 9 mos ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
8 freelancers are bidding on average $20 USD for this job

Hi client, I'm Denis Redzepovic, an experienced developer with expertise in API Integration, Hugging Face, AI Development, Python, Mac OS, Software Architecture, Documentation, OCR, Automation and Objective C. I have worked extensively on diverse Python projects, ranging from backend development and automation to data processing and API integrations. My deep understanding of Python’s libraries and frameworks allows me to build efficient, scalable, and maintainable solutions. I pay close attention to code quality and performance to ensure your project runs flawlessly. With my solid experience, I’m confident I can deliver results that exceed your expectations. I focus on writing clean, maintainable, and scalable code because I know the difference between 99% and 100%. If you hire me, I’ll do my best until you’re completely satisfied with the result. Let’s discuss your project details so I can tailor the perfect Python solution for you. Thanks, Denis
$30 USD in 1 day
5.2
5.2

Hello! I'm Ryan, and I'm incredibly excited about your macOS quiz automation project! With 18 years of experience in Python development, including extensive work with automation, OCR, and AI integration, I'm confident I can deliver a reliable and effective solution, even within the $30 budget (though I'll admit, it's a tight one!). I've built similar scripts leveraging PyAutoGUI, OCR libraries like Tesseract and PaddleOCR (including bounding box extraction), and various AI APIs including Together AI and Hugging Face. I'm also very familiar with handling macOS Retina scaling, ensuring precise cursor placement. I understand the need for hotkey activation ("/") across all applications, configurable AI backends (via config file or .env), and the prioritization of DOM parsing (Playwright/Selenium) with OCR as a fallback. I can certainly implement the hover-over functionality before clicking. I'm committed to providing a well-documented, polished script with clear setup instructions and ongoing communication throughout the process. Let's discuss how I can bring your vision to life!
$10 USD in 1 day
4.8
4.8

Hi, I’ll build a Python automation script for macOS that triggers on /, captures the screen, extracts question/choices via OCR or DOM (fallback), and integrates with configurable AI backends. The script will handle Retina scaling, provide bounding box coordinates, and support both hover and auto-click on the predicted answer. Delivery includes polished code, setup guide, and README for easy backend switching and reliable use.
$30 USD in 3 days
3.6
3.6

Hello, I can build a Python automation script for macOS that triggers with “/”, captures the screen, extracts question and choices via OCR with bounding boxes, uses DOM parsing as primary and OCR as fallback, integrates with AI backends (Together AI, Hugging Face, Ollama), and clicks the correct answer with proper Retina scaling. It will include error handling, setup guide, and easy config for AI switching. Best regards, Shakila Naz
$20 USD in 7 days
4.0
4.0

I have reviewed project details, I will develop a macOS automation script that captures the screen, extracts question and options via OCR, queries an AI model for the correct answer, and clicks the corresponding choice automatically. Could you please confirm which AI model/API you prefer to use for predictions: Together AI, Hugging Face, or Ollama? Send me a message so we can finalize the details and get started today. Thanks, Raza
$20 USD in 2 days
2.1
2.1

Hello, I’m Raymundo, fullstack developer with 7 years experience. I’ve also worked quite a bit with Python automation, OCR libraries, and APIs, so your project sounds like something I could get running smoothly on macOS. My plan would be to set up a script that listens for your global “/” hotkey, grabs the current screen, runs it through OCR (with PaddleOCR for bounding boxes), and tries DOM parsing first if the page structure allows it. Then I’d wire it up so the AI backend is pluggable through a simple config or .env, returning the answer and moving or clicking the cursor in the right spot while handling Retina scaling. I have some questions regarding your project: do you want the default behavior to be automatic clicking, or just hovering unless you toggle it in the config? And for DOM parsing, is the content always web-based inside a browser, or could it sometimes be native apps too? Happy to make this lightweight and easy to run with clear setup docs. Let me know how you’d like to proceed. Raymundo
$20 USD in 7 days
1.8
1.8

IF YOU NOT HAPPY YOU DONT PAY!!! I am confident that my Python automation skills align perfectly with the requirements of your project. With strong expertise in Python, OCR libraries such as Tesseract and PaddleOCR, AI integration, and experience with PyAutoGUI for mouse/keyboard automation, I can deliver a polished automation script for macOS. By ensuring Retina display scaling works flawlessly, offering the flexibility to switch between AI providers, clear documentation, and setup instructions, I will provide a seamless user experience. The script will feature global hotkey integration, configuration of AI backends via a simple file, and precision in cursor movement to the correct option. With a commitment to error handling and adaptability between DOM parsing and OCR methods, I am dedicated to meeting your specific requirements. Let's discuss further to bring this project to life efficiently and effectively. Kind Regards, Shaylin
$10 USD in 30 days
0.0
0.0

Penryn, United States
Payment method verified
Member since Jul 7, 2025
$10-30 USD
$30-250 USD
$10-30 USD
$10-30 USD
$10-30 USD
$250-750 AUD
€30-250 EUR
$250-750 USD
$30-250 USD
$750-1500 AUD
₹1500-12500 INR
£20-250 GBP
$3000-5000 USD
$250-750 USD
₹12500-37500 INR
₹400-750 INR / hour
₹600-1500 INR
₹1500-12500 INR
₹12500-37500 INR
$25-50 USD / hour
$250-750 USD
₹12500-37500 INR
₹37500-75000 INR
$1500-3000 AUD
₹1500-12500 INR