Find Jobs
Hire Freelancers

Problem solution for Spark use case

$750-1500 USD

Lukket
Slået op over 6 år siden

$750-1500 USD

Betales ved levering
1) SO I have 60 Millions usa and canada postals Created dataframe customer_df => good and bad => 60M 2) downloaded some Good postals from internet to filter the customer_df data Created one dataframe good_df => which is good postals => 1M 3) Perfomed Join between customer_df and good_df wiht zipcode to seperate the good values filter_df = good zip [login to view URL](cus_df,zipcode) 4) Then seperated bad data with the below logic bad_df = [login to view URL](filter_df) Now still we can filter bad_df with city names city_df = [login to view URL](bad_df,city) Then did unioin between both df's total_filter = [login to view URL](city_df) it taking 1.30 mints (used spark with 8 node cluster each node 32 gb => spark-submit driver memory -8g and num-executors - 8 and executor-memory- 8g) any other technology or any other tool to clean-up the data within 15 to 20 mints(again customer data is 60M
Projekt-ID: 14803384

Om projektet

16 forslag
Projekt på afstand
Aktiv 7 år siden

Leder du efter muligheder for at tjene penge?

Fordele ved budafgivning på Freelancer

Fastsæt dit budget og din tidsramme
Bliv betalt for dit arbejde
Oprids dit forslag
Det er gratis at skrive sig op og byde på jobs
16 freelancere byder i gennemsnit $1.101 USD på dette job
Brug Avatar.
I am a data scientist and have experience with Big Data Technologies like Spark and Hadoop. I also have experience with NoSQL databases like HBase, Cassandra, etc. Previously I have worked with in Spark related projects like - Real Time ClickStream Analysis using Spark Streaming, Twitter Sentiment using SparklyR and others. I also have worked with Messaging Queues like Kafka. I would like to help you. Please provide me more details.
$1.100 USD på 5 dage
5,0 (6 anmeldelser)
4,4
4,4
Brug Avatar.
I propose first analyzing your current algorithm to find the bottleneck and either rewriting your algorithm, reconfiguring your environment or finding other technologies e.g. Impala. Relevant Skills and Experience I have experience working with big data in hadoop clusters using Hive, Apache Pig + Java, Spark and Impala. Proposed Milestones $400 USD - Analysis of current code and environment, list possible solutions in order of priority $850 USD - Test candidate solutions and implement best solution
$1.250 USD på 20 dage
5,0 (3 anmeldelser)
3,2
3,2
Brug Avatar.
Hello, I am 7+ years experienced Big data developer and I understand the job and will provide the desired solution. Please spare some time to discuss further. Relevant Skills and Experience My Key skills are: Java J2EE, HBase, Hive, Pig, Cassandra, Spark, Hadoop, Cassandra, Scala, MongoDB and latest cutting edge technologies. Proposed Milestones $1079 USD - Big data developer
$1.079 USD på 12 dage
4,7 (4 anmeldelser)
3,5
3,5
Brug Avatar.
Hi, We are 5 big data enthusiasts with expertise in core technologies like Hadoop,spark,mongodb,hive,pig,R,etc. All of us have the development experience on platforms like Scala,python and java. Our vision is to deliver best solutions to our clients with great team work and dedication. To know more, kindly check our profile Thanks, Team-UBF
$750 USD på 10 dage
4,9 (6 anmeldelser)
3,4
3,4
Brug Avatar.
Hello Sir... I have a very good experience in Spark & Scala. Please contact me for more details when possible. I look forward to work for you Sir. Best Regards. Relevant Skills and Experience I am a computer science tutor, I teach (among others) Data analysis and Algorithms. Proposed Milestones $750 USD - 1
$750 USD på 15 dage
5,0 (2 anmeldelser)
2,8
2,8
Brug Avatar.
I am new to freelancer but I have been working on field of Big data for more than 3 years. The project description tells you are technical as well. I think the pseudocode can be optimized. Relevant Skills and Experience I have more than 3 years experience on Big data technology like Hadoop, Spark, Cascading, Elasticsearch, Redis, etc. I have worked on several data processing and optimizations problems. Proposed Milestones $833 USD - Project completion using same data and resources (some other technologies can be added) I would like to get data and access to clusters so that I can start working right away.
$833 USD på 10 dage
5,0 (1 bedømmelse)
0,4
0,4
Brug Avatar.
I have experience in tuning and debugging Spark jobs for one of the Fortune 6 companies which processes large amount of data.
$750 USD på 10 dage
0,0 (0 anmeldelser)
0,0
0,0
Brug Avatar.
Hello, With an experience of 7 Years into Java, 3 Years into Hadoop & 1+ year into Spark, excellent solution is guaranteed. Whats your value for "--master" and "--deploy-mode" in spark-submit command Relevant Skills and Experience Spark, Java, RDD Proposed Milestones $250 USD - Discussion on spark command and showing optimization demo $583 USD - Deliver the entire project Whats your value for "--master" and "--deploy-mode" in spark-submit command ?
$833 USD på 20 dage
0,0 (1 bedømmelse)
0,0
0,0
Brug Avatar.
Hello there. I have seen your job posting. I will like to ask some questions. Please come over the chat so we can discuss things. Relevant Skills and Experience All the skills/experience will be discussed/revealed upon chat. Proposed Milestones $625 USD - default $625 USD - default I need to know the technical details of this project. Please provide me my job description.
$1.250 USD på 20 dage
0,0 (0 anmeldelser)
0,0
0,0
Brug Avatar.
I have experience in working with Apache Spark and the manipulate DataFrames and RDD, by means of python
$1.111 USD på 5 dage
0,0 (0 anmeldelser)
0,0
0,0
Brug Avatar.
Hello, i have a lot experience in the field g feel free to ask for my work,............................
$1.500 USD på 20 dage
0,0 (0 anmeldelser)
0,0
0,0
Brug Avatar.
I am a Big Data Engineer certified by Simplilearn Relevant Skills and Experience Big Data Hadoop and Spark Developer Proposed Milestones $750 USD - It will be cleared in 8 days (Only Weekends will be calculated as working day)
$750 USD på 8 dage
0,0 (0 anmeldelser)
0,0
0,0
Brug Avatar.
django PHP Arduino hadoop metatrader web design python machine learning HTML,HTML5 graphic design wordpress Android unity3d Relevant Skills and Experience django PHP Arduino hadoop metatrader web design python machine learning HTML,HTML5 graphic design wordpress Android unity3d Proposed Milestones $1666 USD - full
$1.666 USD på 20 dage
0,0 (0 anmeldelser)
0,0
0,0

Om klienten

Flag for UNITED STATES
United States
0,0
0
Betalingsmetode verificeret
Medlem siden apr. 7, 2017

Klientverificering

Tak! Vi har sendt dig en e-mail med et link, så du kan modtage din kredit.
Noget gik galt, da vi forsøgte at sende din mail. Prøv venligst igen.
Registrerede brugere Oprettede jobs i alt
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Indlæser forhåndsvisning
Geolokalisering er tilladt.
Din session er udløbet, og du er blevet logget ud. Log venligst ind igen.