
Closed
Posted
Paid on delivery
I have a growing library of MP4 recordings of casino-grade slot machines and I need a reliable way to turn each session into structured data. For every spin I want the script to capture the start time, end time, bet size, win amount, any bonus triggers, and jackpots, then tally an overall spin count. The videos cover many different games, so the screen layout shifts from title to title; your solution therefore has to locate each region of interest on its own instead of relying on hard-coded coordinates. Because bonuses and jackpots are marked with visually distinct graphics, you can leverage that cue when classifying events. Accuracy matters more than raw speed, but I still expect processing to run unattended once configured. Please build a computer-vision / OCR pipeline that: • Automatically detects and tracks the changing ROI zones for bet, win, credit, and any bonus or jackpot indicators • Parses the text or numerical values frame-by-frame to create a chronological log of every spin • Outputs clean CSV or JSON per session, plus a summary of total spins, total wagered, total won, and jackpots for tax reporting • Includes a brief README and example command that reproduces your results on my sample MP4s Acceptance criteria • Works on at least three different game layouts without manual retuning • ≥99.95 % accuracy on spin count, bet size and win amount when compared with my hand-labeled ground truth • Correctly flags 100 % of visually distinct bonus/jackpot events in the provided test set Feel free to use OpenCV, Tesseract, YOLO, or any modern deep-learning framework—whatever achieves dependable results.
Project ID: 40438087
102 proposals
Remote project
Active 8 hours ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
102 freelancers are bidding on average $161 USD for this job

Hi there, I have carefully reviewed the requirements for your project involving the automation of slot video data extraction. I understand the need for a reliable script that can accurately extract structured data from MP4 recordings of casino-grade slot machines, including start time, end time, bet size, win amount, bonus triggers, jackpots, and overall spin count. Let's chat and discuss it further. To handle your project, I will start with developing a computer-vision / OCR pipeline using OpenCV and Tesseract. This pipeline will automatically detect and track changing ROI zones, parse text or numerical values frame-by-frame, and generate clean CSV or JSON outputs per session along with a summary of key metrics. The deliverables will include a fully functional script that meets the specified acceptance criteria and provides accurate data extraction from various game layouts. Before signing-off my bid, I would like to ask a question, i.e., how many MP4 recordings are currently in your library for processing? Warm Regards, Aneesa.
$100 USD in 1 day
6.8
6.8

Hi there, I’ve read your Slot Video Data Extraction project and I’m confident we can build a robust CV/OCR pipeline that adapts to various layouts without manual retuning. I’ll start with an ROI discovery module that tracks dynamic regions for bet, win, credit and any bonus or jackpot indicators, then apply frame-by-frame parsing to assemble a chronological spin log with start and end times, bet size, win amount, and event markers. I’ll leverage a lightweight detector for layout changes and a tolerant OCR pipeline, so accuracy stays high even when graphic styles shift across games. I’ll value accuracy over speed and design an unattended pipeline that runs on a schedule, with a simple command to reproduce results on your sample MP4s. I’m interested in the project and have several experiences with similar automation tasks, including building end-to-end CV/OCR data extractors for multi-layout video sources and exporting clean CSV/JSON outputs with summaries. I’ll deliver a README with setup instructions and an example command, plus a per-session JSON/CSV output and a concise tax-ready summary of spins, wagered, won, and jackpots. For next steps I can draft a minimal MVP in about 7-10 days, then iterate on accuracy with your ground-truth data.
$155 USD in 12 days
5.9
5.9

Hello, I understand you need a robust computer-vision system to extract structured spin-by-spin data from MP4 slot machine recordings, including start/end time, bet, win, bonuses, jackpots, and total spin counts across multiple games with shifting layouts. Accuracy is critical, and the solution must work without hard-coded screen coordinates. I will build a dynamic CV + OCR pipeline using OpenCV with YOLO-based detection to automatically locate UI regions (bet, win, credits, bonus/jackpot indicators) across different slot layouts. Frame tracking and temporal logic will reconstruct each spin sequence, while OCR (Tesseract/EasyOCR) will extract numeric values reliably. Bonus/jackpot events will be detected using visual classification patterns, and all data will be structured into clean CSV/JSON outputs with full session summaries (total spins, wagered, won, jackpots). The system will be fully automated, tested across multiple game layouts, and designed for high accuracy (99.95% target alignment with ground truth). I will also include a clear README, reproducible script, and optimized pipeline for unattended batch processing of MP4 files. I’m ready to start immediately and tune the model based on your sample videos. Thanks, Asif
$250 USD in 3 days
5.7
5.7

&& YOLO, OCR, OpenCV, Tensorflow, PyTorch, Keras, ML/DL model && Hi, How are you?. I have full skills and full experiences of this field. I have developed many Image Processing project and I am expert in these fields I can finish your project with high quality and on time. Please send me your message to discuss more about your project. I am waiting your reply now. Thanks.
$140 USD in 3 days
5.8
5.8

I can build a robust OpenCV/OCR pipeline using YOLO + Tesseract/PaddleOCR to dynamically detect slot-machine UI regions, track spins across changing layouts, classify jackpots/bonuses, and export high-accuracy CSV/JSON session reports automatically.
$140 USD in 1 day
5.4
5.4

Hi there, I will build a CV/OCR pipeline that dynamically detects ROI zones across varying slot layouts — extracting bet size, win amount, spin timing, and bonus/jackpot flags — then outputs per-session CSV/JSON with tax-ready summaries. For reliable ROI detection across different games, I will use template-free contour analysis paired with temporal differencing to isolate value regions, then run Tesseract with digit-tuned configs for high-accuracy numeric extraction. Questions: 1) How many distinct game titles are in the current library? 2) What resolution and frame rate are the MP4 recordings? Looking forward to potentially working together. Thanks, Kamran
$90 USD in 5 days
5.6
5.6

Hi, I am a computer vision and automation developer with 8 years of rich experience. I am familiar with Python, OpenCV, OCR, Deep Learning and Computer Vision. For this project, the most important issue is building a robust pipeline that detects dynamic regions, parses spin data frame-by-frame, and outputs structured CSV/JSON with accurate bets, wins, and bonus events. I'm an individual freelancer and can work on any time zone you want. Please contact me with the best time for you to have a quick chat. Looking forward to discussing more details. Thanks. Emile.
$250 USD in 7 days
5.2
5.2

Hello, I can develop a computer vision/OCR pipeline that will meet all your requirements. I’d love to chat further about the project and how we can move forward. I believe in clear communication and close collaboration, so you’ll always stay updated throughout the process to ensure the final pipeline matches exactly what you’re looking for. Best regards, Fahad.
$100 USD in 1 day
5.2
5.2

Hi, I can build a robust computer-vision + OCR pipeline for your slot machine MP4 analysis that automatically extracts structured spin-by-spin data across different game layouts. I have experience with: OpenCV-based video frame processing OCR pipelines (Tesseract / EasyOCR / deep-learning OCR) YOLO-style object detection for dynamic UI elements Event detection and time-series reconstruction from video Building offline, automated data extraction workflows
$220 USD in 2 days
4.8
4.8

✋ Hi There!!! ✋ The Goal of the project:- TO AUTOMATE EXTRACTION OF STRUCTURED DATA FROM CASINO SLOT VIDEO RECORDINGS WITH HIGH ACCURACY I have carefully read and understood your project description and can develop a Python computer vision and OCR pipeline using OpenCV, Tesseract, and YOLO to detect ROIs dynamically, parse bets, wins, bonuses, and jackpots, and output CSV/JSON logs with summary statistics. I am the best fit because I have 9+ years experience as a full stack developer with extensive work in data extraction and CV automation. I match your requirements: dynamic ROI detection for varying layouts, accurate frame-by-frame OCR for bet/win amounts, and detection of visually distinct bonus/jackpot events. I provide UI design for pipeline monitoring, database management for logs, testing, and full source code delivery with README and sample runs. Looking forward to chat with you for make a deal Best Regards Elisha Mariam!
$111 USD in 11 days
4.6
4.6

Hi,I am a seasoned Applied ML Engineer(6+ yoe) & I can build this as a robust video-to-structured-data pipeline for slot-machine MP4s,combining computer vision,OCR,event detection,& validation logic instead of relying on fixed screen coordinates Proposed Approach: >>Visual Detection:Identify game anchors (reels,buttons,banners) using YOLO/OpenCV for dynamic ROI localization without hard-coded coordinates >>OCR & Extraction:Implement PaddleOCR/Tesseract with VLM fallbacks to extract numeric & text data from complex UI states >>Event Processing:Refine raw OCR into reliable events via temporal smoothing,state-machine logic & regex validation to track spin boundaries >>Classification:Detect bonus & jackpot triggers using visual templates & trained classifiers for distinct graphics >>Reporting & QA:Generate session-level CSV/JSON logs (bet,win,totals) & benchmark predictions against ground truth with uncertainty flagging Relevant experience: >>Built OCR-heavy pipelines using PaddleOCR/OpenCV,YOLO,frame cropping,tracking,& event logging >>Worked on related projects such as ANPR,marathon bib detection,document/text extraction,video-based object tracking,& structured data generation from visual streams I would deliver source code,model/ROI detection logic,OCR post-processing,example commands,sample outputs,& a validation summary against your labeled MP4s
$250 USD in 7 days
4.4
4.4

Hello there, we are a team of senior Full Stack Web and Mobile App Developers and we can do this project in no time. Please, send me a message to discuss the work. Thanks Ashish Kumar.
$140 USD in 7 days
4.3
4.3

Hi, I can build a robust OpenCV/OCR pipeline that automatically detects dynamic ROIs across different slot-machine layouts, extracts spin/bet/win/jackpot data, and outputs clean CSV/JSON session reports. My approach would combine ROI detection/tracking, OCR validation, and visual-event classification (bonus/jackpot detection) to achieve high accuracy across multiple game themes without hard-coded coordinates. You’ll receive a reproducible pipeline with README, CLI commands, and structured summaries for spins, wagers, wins, and jackpots, designed for unattended batch processing on MP4 libraries.
$140 USD in 9 days
4.0
4.0

Hi, this is a strong fit for a Python computer-vision/OCR pipeline, not just a simple screen scraper. I’d approach this by first building a small validation set from your labeled MP4s, then detecting stable UI anchors and dynamically locating the bet/win/credit/bonus regions per game layout. The main risk is false reads from animations, flashing text, or layout changes, so I’d avoid relying on single frames and instead use frame windows, confidence scoring, OCR cleanup, and event-state tracking before writing each spin to CSV/JSON. For example, if a win amount appears only briefly after a spin, the pipeline should track the spin state, read the value across several frames, and only commit the result once the same value is confirmed. I can build this with Python, OpenCV, OCR, and object detection where needed, plus a clear README and repeatable command for your samples. Thanks!
$220 USD in 7 days
4.0
4.0

⭐⭐⭐⭐⭐ ✅Hi there, hope you are doing well! I recently completed a project automating data extraction from video footage of gaming machines, where I developed a robust OCR and computer vision pipeline that accurately parsed changing screen elements with ease. From my experience, the key to success in this project is dynamically detecting and tracking regions of interest across varied layouts to ensure precise data capture without hard-coded coordinates. Approach: ⭕ I will employ advanced object detection models like YOLO to dynamically locate ROIs for bets, wins, and bonuses. ⭕ Apply frame-by-frame OCR using Tesseract with preprocessing in OpenCV to accurately read numerical data. ⭕ Use image classification techniques to identify and flag visually distinct bonus and jackpot events reliably. ⭕ Aggregate parsed data into clean CSV/JSON outputs with comprehensive summaries for tax reporting. ⭕ Deliver a detailed README with sample commands for seamless reproduction. ❓ Can you provide sample videos for initial testing and labeling? ❓ Are there any preferred formats or specific summary metrics beyond those listed? ❓ Would you like real-time processing or batch mode? I am confident in delivering a high-accuracy, fully automated solution tailored to varied slot game layouts that meets your stringent accuracy and reporting requirements. Best regards, Nam
$200 USD in 3 days
3.8
3.8

For accurate detection of regions of interest (ROI) across different game layouts, I recommend leveraging a combination of OpenCV for computer vision tasks and Tesseract for OCR capabilities. How familiar are you with these frameworks? I’ve developed similar pipelines that autonomously track changing ROIs without relying on hard-coded coordinates, utilizing visually distinct elements for event classification. My approach will ensure a chronological logging of every spin, outputting structured data in CSV or JSON format, alongside a comprehensive summary for tax reporting. I’ll also provide a README with clear documentation for replicability. With extensive experience in computer vision and a focus on accuracy, I’m confident in meeting your requirements of ≥99.95% accuracy on key metrics. I’m ready to discuss your project further and address any specific needs you may have. Best Regards,
$150 USD in 5 days
3.6
3.6

Hi, I’m an experienced computer vision and OCR developer with strong experience turning video recordings into structured CSV or JSON data. I can build an automated pipeline for your slot machine MP4 sessions that detects spin start and end times, bet size, win amount, bonus triggers, jackpots, and total spin count without relying on fixed screen coordinates. I’ve done similar projects involving OpenCV based video parsing, OCR cleanup, dynamic ROI detection, event classification, frame sampling, template matching, and validation against hand labeled ground truth. For your project, I can combine OpenCV, OCR, and if needed YOLO based detection to locate bet, win, credit, bonus, and jackpot areas across different game layouts. The system will process videos unattended, generate per session logs, and summarize total spins, total wagered, total won, and jackpot events for reporting. I’ll also provide a README, example command, and testing process using your sample MP4s to verify accuracy across at least three layouts. Best regards, George
$140 USD in 7 days
3.6
3.6

Dear Sir, I am thrilled to bid your project. I can build a computer-vision and OCR pipeline that converts your MP4 slot-machine sessions into structured CSV/JSON logs with spin timing, bet size, win amount, bonus triggers, jackpots, and summary totals. I have experience with OpenCV, OCR preprocessing, video frame analysis, object detection, template matching, YOLO-style models, and automated data extraction from changing UI layouts. My approach would be to first analyze your labeled sample videos, detect screen regions dynamically using visual anchors instead of hard-coded coordinates, then track bet, win, credit, spin state, bonus, and jackpot indicators across frames. For accuracy, I would combine ROI detection, frame sampling, OCR cleaning, value validation, event-state logic, and comparison against your ground-truth labels. The output will include chronological per-spin records, session summaries, and a README with example commands to reproduce results on your MP4s. One important question: do your sample videos include consistent button/spin animations or reel-stop transitions that can be used as the primary spin boundary signal? This is crucial because reliable spin start/end detection is usually the key to reaching your 99.95% accuracy target. I can deliver a configurable unattended pipeline and test it across at least three different game layouts. Sincerely, Adison.
$140 USD in 7 days
3.5
3.5

Dynamic ROI detection across multiple game layouts is the hardest part of this spec, and it is exactly where most computer-vision pipelines break. Here is how I handle it. Instead of hard-coded coordinates, I use a two-stage approach: a lightweight YOLO detector trained on generic slot UI elements (credit panel, bet field, win display, bonus badge) to locate each region dynamically, then EasyOCR running frame-differencing to trigger reads only when a value actually changes, which keeps the pipeline fast while hitting your accuracy target. For bonus and jackpot classification, the visual distinctiveness you described lets me use color histogram thresholds combined with a secondary YOLO class, giving deterministic 100% recall on flagged events. Output is CSV and JSON per session plus the summary fields you listed (total spins, wagered, won, jackpots) formatted for tax reporting. Everything ships with a README and a single command to reproduce results on your sample MP4s. Before I scope the final solution: how many distinct game titles are in your test set, and do you have frame-level ground truth or only spin-level labels? That determines whether I need a fine-tuning pass or can ship with a zero-shot baseline.
$100 USD in 4 days
3.6
3.6

Drawing from my extensive background as a full-stack engineer where I've spent the last 6+ years building and implementing sophisticated web applications, I am well-equipped to tackle the complexities of your project. My experience spans the entire process from requirements gathering to deployment, which matches perfectly with your need for a solution that runs unattended once configured. Automating business processes is not just second nature to me but a key strength that stems from my deep understanding of clean architecture, performance, and reliability. Efficiency and accuracy are pivotal for your project, and this particularly resonates with my focus on reducing manual work and operational errors through workflow automation. I have considerable experience with IT solutions like n8n and Zapier that connect systems for streamlined processes. This aligns well with your vision of a computer vision / OCR pipeline solution for extracting structured data which essentially involves maximizing data throughput while ensuring reliability. In addition to these skills, I also boast experience in data pipelines, analytics, and ML components - tools that will prove useful for reporting, dashboards, or forecasting should you need them in the future. This all-roundedness enhances the value-addition quotient I can bring to our collaboration. Together we can not only automate this task but also lay a foundation that can be expanded for any analytical needs in future!
$140 USD in 7 days
3.6
3.6

McKinney, United States
Payment method verified
Member since Jun 7, 2007
$30-100 USD
$30-100 USD
$100-300 USD
₹12500-37500 INR
$10-30 USD
₹750-1250 INR / hour
₹37500-75000 INR
$10-25 AUD
$15-25 USD / hour
₹1500-12500 INR
₹1500-12500 INR
min £36 GBP / hour
₹750-1250 INR / hour
₹750-1250 INR / hour
$25-50 USD / hour
$250-750 USD
₹100-400 INR / hour
₹750-1250 INR / hour
₹600-1500 INR
$250-750 USD
₹12500-37500 INR
$30-250 USD
₹1500-12500 INR