Find Jobs
Hire Freelancers

File Parser to Scan CSV.GZ Files and Calculate Unique IP Addresses

$10-100 USD

Færdiggjort
Slået op cirka 5 år siden

$10-100 USD

Betales ved levering
Problem: I have a large number of [login to view URL] files that each contain csv data. There are about 250 of these files and each one is about 100mb in size (compressed). The CSVs contain lists of IP addresses in column A and I would like to know how many unique IP addresses there are in the files (total, not per file). Solution: I would like a file parser written in python that will scan the [login to view URL] files and tell me how many unique IP addresses exist within the csv data. By this I mean total unique IP addresses amongst ALL the data files, not just the unique IP addresses within each individual file. Thank you for your assistance. Please include a brief description in your PM along with your bid so I can tell that you actually read the project description instead of using an auto-bidder to bid on the project. Just a sentence or two will do.
Projekt-ID: 18989668

Om projektet

15 forslag
Projekt på afstand
Aktiv 5 år siden

Leder du efter muligheder for at tjene penge?

Fordele ved budafgivning på Freelancer

Fastsæt dit budget og din tidsramme
Bliv betalt for dit arbejde
Oprids dit forslag
Det er gratis at skrive sig op og byde på jobs
Tildel til:
Brug Avatar.
Hello sir, I can parse your files for IP adresses as you like. Please take a look at my reviews and portfolio. I am interested in your project as well. I would like to discuss details via pm. I look forward to hearing from you soon. Best Regards,
$100 USD på 1 dag
5,0 (43 anmeldelser)
5,9
5,9
15 freelancere byder i gennemsnit $68 USD på dette job
Brug Avatar.
A Bash script will be a better/faster option. Will extract each file, append the IPs to a temporary files, then we will unique sort them, using sort -u -k...
$40 USD på 1 dag
4,9 (556 anmeldelser)
7,6
7,6
Brug Avatar.
Hi Nice to meet you. I checked your description. I have some similiar script for processing .gz files. I will use numpy library for your project. Just get total unique count is not problem. My question is how you will share those csv files, so I can test properly. Regards Lian
$100 USD på 10 dage
4,9 (119 anmeldelser)
6,9
6,9
Brug Avatar.
Its about your project you posted: things can also be done in Java. We can do all the task you mentioned in here. hello, warm greetings! I am a Java developer working on Java technology since 7+ years having hands on windows and web development experience. I would like to help in your application development. It would be better if you can share more details about it if you are interested. Please message me to discuss more about the requirements. Budget is negotiable. looking for your positive response. Thanks Namit
$70 USD på 10 dage
4,9 (81 anmeldelser)
5,8
5,8
Brug Avatar.
Hey there, I can develop the CSVs parser to count unique IP addresses. I'm a System Engineer with coding skills. I had developed tons of Python scripts. Would you share more details? Regards.
$100 USD på 10 dage
4,7 (29 anmeldelser)
5,4
5,4
Brug Avatar.
Hi, I'm not strong on Python, but could write this for you in Golang and compile it for Windows or Linux. I've done work parsing and analysing BGP router views logs, so have I previous experience working with large amounts of IP data. I can read the data from the decompression stream and use a radix tree for optimum speed and minimum memory usage. I should be able to complete this today.
$55 USD på 1 dag
5,0 (6 anmeldelser)
4,3
4,3
Brug Avatar.
Hi I checked your requirement, I'm sure I can do it well. Gzip library in Python supplies some methods to manipulate with a gzip big file, read all lines from gzip files. Please give me an opportunity. I will do it perfectly.
$30 USD på 5 dage
4,4 (62 anmeldelser)
5,1
5,1
Brug Avatar.
Hey there. Your project looks straightforward. I have done a lot of work with Python and csv files before, and even though I haven't worked with gzipped data, I don't see that being a problem. Even though 100Mb isn't much, I think I'll implement lazy reading so the entire file is not read into memory at once. Let us discuss this further to see if I am the right person for this job.
$100 USD på 7 dage
4,9 (4 anmeldelser)
3,4
3,4
Brug Avatar.
Greetings. There are two ways how we can solve this task: 1) Dump all the data to SQLite DB and fetch all unique IPs via SQL 2) Iterate over every CSV file and add to global variable only unique IPs, but it will be more memory consuming.
$66 USD på 2 dage
5,0 (5 anmeldelser)
3,0
3,0
Brug Avatar.
I am a database / Business intelligence architect and having more than 14 years of experience in IT industry. I can achieve it using SSIS ETL tool. Let me know if you are ok then we can talk further.
$88 USD på 2 dage
5,0 (4 anmeldelser)
3,2
3,2
Brug Avatar.
How's it going? No, this is not an auto-bid. I hope this is enough to prove I've read your project description. I'm Sky, a German Development Enginner with more than 6 years experience in Software Engineering. I would like to write the Python script for you. Python offers a great datastructure that already checks if an entry exists or not. This way, we can determine the unique IP addresses. Best Regards, Sky Haubrich
$35 USD på 0 dag
0,0 (0 anmeldelser)
0,0
0,0
Brug Avatar.
Have been working in Python for past one year.. Also contributed to an open source org Symy in past Relevant Skills and Experience Previously worked on Python built Sympy(Open Source) Used Python for ML purpose too.
$61 USD på 10 dage
0,0 (0 anmeldelser)
0,0
0,0
Brug Avatar.
Hi , Hope you are doing well. Should be able to deliver this project in a day max . I understand the requirement . (ip address-is it ipv4 or ipv6 ). I will be using python as language for implementing this. We can discuss further on this , if required My skype I’d @ live:d311a9e099b4554 Thanks, Ram
$66 USD på 1 dag
5,0 (1 bedømmelse)
0,0
0,0
Brug Avatar.
Hi. Your task interesting for me. Python is my third language and I look out more practice for it everywhere. I will glad to do it task for you. p.s. my bid is 55 and approximately 1-2 day for work, testing and correction.
$55 USD på 2 dage
5,0 (1 bedømmelse)
0,0
0,0
Brug Avatar.
I am an Embedded software Engineer working at Stakrbits company, programming with C , C++ and python
$55 USD på 10 dage
0,0 (0 anmeldelser)
0,0
0,0

Om klienten

Flag for UNITED STATES
New York, United States
5,0
154
Betalingsmetode verificeret
Medlem siden jul. 14, 2011

Klientverificering

Tak! Vi har sendt dig en e-mail med et link, så du kan modtage din kredit.
Noget gik galt, da vi forsøgte at sende din mail. Prøv venligst igen.
Registrerede brugere Oprettede jobs i alt
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Indlæser forhåndsvisning
Geolokalisering er tilladt.
Din session er udløbet, og du er blevet logget ud. Log venligst ind igen.