
Igangværende
Slået op
Betales ved levering
Project Overview I have an Excel dataset containing approximately 11,440 licensed plumbing contractors in California (C-36 classification). The dataset includes contractor license information such as business name, city, address, and phone number. I am seeking an experienced data scraping / lead enrichment specialist to locate verified business email addresses associated with these contractors and append them to the spreadsheet. The goal is to identify publicly available contact emails belonging to the contractor’s business and integrate them into the existing dataset. ⸻ Source Dataset The spreadsheet contains approximately 11,440 contractor records and includes fields such as: • License Number • BusinessName • BUS-NAME-2 (DBA name, if present) • FullBusinessName • Mailing Address • City • State • Phone Number • License Status • Bond information To assist bidders in evaluating the project, a sample file containing approximately 200 records will be provided. The full dataset will be provided to the selected freelancer after the project is awarded. ⸻ Scope of Work For each contractor in the dataset: 1. Identify the official business website or domain associated with the contractor. 2. Locate public contact email addresses belonging to the contractor’s business. 3. Extract emails from sources such as: • official company websites (contact/about/footer pages) • Google Business / Google Maps listings • Yelp business listings • Better Business Bureau listings • relevant industry or business directories 4. Verify the email address using an email verification service. ⸻ Important Search Instruction Many California contractors register their CSLB license under a personal name but operate publicly under a DBA company name. The dataset includes additional name fields such as: • BUS-NAME-2 • FullBusinessName When researching each contractor, please search using both: • BusinessName • BUS-NAME-2 / FullBusinessName Use whichever name correctly identifies the contractor’s business online. ⸻ Required Columns to Add Please add the following columns to the spreadsheet and populate them during the enrichment process: • SearchQuery (Business name + city + CA to assist in locating the business online) • SearchName (Preferred search name selected from BusinessName, BUS-NAME-2, or FullBusinessName) • Website (Official website or domain associated with the contractor) • Email (Business contact email) • EmailVerificationStatus (Valid / Risky / Unknown) • SourceURL (Exact webpage where the email was located) All original dataset columns must remain unchanged. ⸻ Critical Quality Requirement! Every email address must include a SourceURL showing the exact page where the email appears. Emails without a verifiable source page will not be accepted. ⸻ Data Authenticity Requirement All emails must come from publicly accessible business sources. The following will not be accepted: • purchased or bulk email databases • AI-generated or guessed email patterns • scraped marketing lists unrelated to the specific contractor • emails without a verifiable source URL If no email address can be found for a contractor, the email field should be left blank. ⸻ Verification Requirement Emails must be verified using a reputable verification method (SMTP verification or a professional verification tool). Verification status must be recorded in the EmailVerificationStatus column. ⸻ Multi-Source Search Requirement If an email address is not immediately visible on the contractor’s website, please check additional public sources before marking the record as “no email found”, including: • Google Business / Google Maps listings • Yelp business listings • Better Business Bureau listings • other reputable business directories The SourceURL column must identify the exact page where the email appears. ⸻ Automation-Friendly Dataset The dataset is structured and consistent and should be suitable for automated enrichment. Each contractor record includes: • BusinessName • BUS-NAME-2 (DBA name if applicable) • FullBusinessName • City • Mailing Address • Phone Number Because the dataset is standardized and organized in Excel format, the project should be suitable for automated scraping or enrichment workflows rather than manual research. Please indicate in your proposal if you plan to use: • automated scraping tools • enrichment platforms • Python scripts or APIs Automation is preferred where possible. ⸻ Quality Expectations Accuracy is more important than raw volume. Typical expected coverage for this type of dataset is approximately 30–60% of businesses. A random sample of approximately 200 records will be reviewed before final acceptance to verify data accuracy. ⸻ Deliverables 1. Updated Excel spreadsheet containing all original rows. 2. Newly added columns populated as described above. 3. Verified email addresses with source URLs. ⸻ Timeline Preferred completion: 3–7 days ⸻ Budget Please submit a fixed-price proposal for enriching approximately 11,440 contractor records. Budget is flexible for experienced scraping specialists who can demonstrate efficient automated workflows. ⸻ Proposal Requirements Please include in your proposal: • tools or methods you plan to use • estimated completion timeline • examples of similar data enrichment or scraping projects ⸻ Note All data must be collected from publicly available sources only. To confirm you have read the full project description, please begin your proposal with the phrase: “PLUMBING DATA.” Let me know your estimated turnaround time and which toolset you prefer, and I’ll share the spreadsheet immediately.
Projekt-ID: 40274449
126 forslag
Projekt på afstand
Aktiv 2 dage siden
Fastsæt dit budget og din tidsramme
Bliv betalt for dit arbejde
Oprids dit forslag
Det er gratis at skrive sig op og byde på jobs