Filtrér

Mine seneste søgninger
Filtrer ved:
Budget
til
til
til
Slags
Færdigheder
Sprog
    Job-status
    900 pyspark jobs fundet, i prisklassen EUR

    ...have a high-complexity T-SQL stored procedure used for data analysis that I need translated into PySpark code. The procedure involves advanced SQL operations, temporary tables, and dynamic SQL. It currently handles over 10GB of data. - Skills Required: - Strong understanding and experience in PySpark and T-SQL languages - Proficiency in transforming high complexity SQL scripts to PySpark - Experience with large volume data processing - Job Scope: - Understand the functionality of the existing T-SQL stored procedure - Rewrite the procedure to return the same results using PySpark - Test the new script with the provided data set The successful freelancer will assure that the new PySpark script can handle a large volume of data and maintai...

    €171 (Avg Bid)
    €171 Gns Bud
    14 bud

    conversion modeling / predictive analytics. The whole department is transitioning to DataBricks. I need help with creating conversion models using pyspark. Compare the results to last year and what could have been a better approach.

    €97 (Avg Bid)
    €97 Gns Bud
    8 bud

    I'm looking for a data engineer with solid Pyspark knowledge to assist in developing a robust data storage and retrieval system, primarily focusing on a Data Warehouse. Key Responsibilities: - Implementing efficient data storage solutions for long-term retention and retrieval - Ensuring data quality and validation procedures are in place - Advising on real-time data processing capabilities Ideal Candidate: - Proficient in Pyspark with hands-on experience in data storage and retrieval projects - Familiar with Data Warehousing concepts and best practices - Able to recommend and implement appropriate real-time processing solutions - Strong attention to detail and commitment to data quality. Specifically, I have a Jira ticket that consists of creating an application tha...

    €426 (Avg Bid)
    €426 Gns Bud
    13 bud

    I'm seeking a knowledgeable Databricks Data Engineer to expertly navigate Python and Pyspark programming languages for my project. Your primary task will be optimize Delta Live Tables pipeline that is processing real-time data processing, optimization, and change data capture (CDC). An extensive working knowledge of Azure cloud platform is a must for this role. Your understanding and ability to apply crucial elements in these areas will greatly contribute to the success of this project. Applicants with proven experience in this field are preferred. In your proposal state if you have DLT experience else you wont be considered.

    €15 / hr (Avg Bid)
    €15 / hr Gns Bud
    9 bud

    As the professional handling this project, you'll engage with big data exceeding 10GB. Proficiency in Python, Java, and Pyspark are vital for success as we demand expertise in: - Data ingestion and extraction: The role involves managing complex datasets and running ETL operations. - Data transformation and cleaning: You'll also need to audit the data for quality and cleanse it for accuracy, ensuring integrity throughout the system. - Handling Streaming pipelines and Delta Live Tables: Mastery of these could be game-changing in our pipelines, facilitating the real-time analysis of data.

    €17 / hr (Avg Bid)
    €17 / hr Gns Bud
    35 bud

    I'm in need of a Machine Learning Engineer who can migrate our existing notebooks from RStudio and PySpark to AWS Sagemaker. Your task will be to: - Understand two models I have running locally. One is a Rstudio logistic regression model, and the other is a pySpark XGboost also running on local. - Migrate These two models to AWS SAGEMAKER. Data will be on S3 -Prepare models to run on sagemaker totally, so that we can do training and testing 100% on sagemaker.-Models are already running on a local computer, but I need to move them to Sagemaker 100%. Data is on S3 already. -You need to configure and prepare Sagemaker from end to end, and teach me how you did it, since I need to replicate it in another system. -I will give you the data and access to AWS Ideal Skills and...

    €206 (Avg Bid)
    €206 Gns Bud
    13 bud

    The Data Engineer contractor role will be a project based role focused on migrating data pipelines from legacy infrastructure and frameworks such as Scalding to more modern infrastructure we support such as Spark Scala. This role will be responsible for: Analyzing existing data pipelines to understand their architecture, dependenci...Requirements The ideal candidate is a Data Engineer with considerable experience in migrations and Big Data frameworks. Must-Haves Scala programming language expertise Spark framework expertise Experience working with BigQuery Familiarity scheduling jobs in Airflow Fluency with Google Cloud Platform, in particular GCS and Dataproc Python programming language fluency Scalding framework fluency Pyspark framework fluency Dataflow(Apache Beam) framewor...

    €279 (Avg Bid)
    €279 Gns Bud
    8 bud

    Hi, Please apply only individual. Agency can apply but budget should not be more than mentioned. Role : GCP Engineer (OTP) Exp : 7 + yrs SHIFT: IST Cloud Storage Buckets, BigQuery (SQL, Data Transformations and movement) Airflow (python, DAGs), DBT IAM Policies PyCharm Databricks (pySpark), Azure DevOps Clear and confident communication

    €1309 (Avg Bid)
    €1309 Gns Bud
    8 bud

    I'm in need of a Machine Learning Engineer who can migrate our existing notebooks from RStudio and PySpark to AWS Sagemaker. Your task will be to: - Understand two models I have running locally. One is a Rstudio logistic regression model, and the other is a pySpark XGboost also running on local. - Migrate These two models to AWS SAGEMAKER. Data will be on S3 -Prepare models to run on sagemaker totally, so that we can do training and testing 100% on sagemaker.-Models are already running on a local computer, but I need to move them to Sagemaker 100%. Data is on S3 already. -You need to configure and prepare Sagemaker from end to end, and teach me how you did it, since I need to replicate it in another system. -I will give you the data and access to AWS Ideal Skills and...

    €368 (Avg Bid)
    NDA
    €368 Gns Bud
    3 bud

    The Data Engineer contractor role will be a project based role focused on migrating data pipelines from legacy infrastructure and frameworks such as Scalding to more modern infrastructure we support such as Spark Scala. This role will be responsible for: Analyzing existing data pipelines to understand their architecture, dependenci...Requirements The ideal candidate is a Data Engineer with considerable experience in migrations and Big Data frameworks. Must-Haves Scala programming language expertise Spark framework expertise Experience working with BigQuery Familiarity scheduling jobs in Airflow Fluency with Google Cloud Platform, in particular GCS and Dataproc Python programming language fluency Scalding framework fluency Pyspark framework fluency Dataflow(Apache Beam) framewor...

    €280 (Avg Bid)
    €280 Gns Bud
    10 bud

    I am looking for a dedicated specialist well-versed in using Databricks and PySpark for data processing tasks, with a primary focus on data transformation. With the provision of JSON format files, you will perform following tasks: - Carry out complex data transformations - Implement unique algorithms to ensure efficient data processing - Test results against required benchmarks Ideal Skills: - Proficient in Databricks and PySpark. - Must possess a solid background in data transformation. - Experience handling large JSON datasets. The end goal is to achieve seamless data transformation leveraging the power of Databricks and PySpark, enhancing our ability to make informed business decisions. Please provide your completed projects, and the strategies you've used ...

    €43 / hr (Avg Bid)
    €43 / hr Gns Bud
    29 bud

    ...functions to handle data quality and validation. -Should have good understanding on S3,Cloud Formation, Cloud Watch, Service Catalog and IAM Roles -Perform data validation and ensure data accuracy and completeness by creating automated tests and implementing data validation processes. -Should have good knowledge about Tableau, with creating Tableau Published Datasets and managing access. -Write PySpark scripts to process data and perform transformations.(Good to have) -Run Spark jobs on AWS EMR cluster using Airflow DAGs.(Good to have)...

    €1362 (Avg Bid)
    €1362 Gns Bud
    22 bud

    ...Stay current with new technology options and vendor products, evaluating which ones would be a good fit for the company Troubleshoot the system and solve problems across all platform and application domains Oversee pre-production acceptance testing to ensure the high quality of a company’s services and products Skill Sets: Strong development experience in AWS Step Functions, Glue, Python, S3, Pyspark Good understanding of data warehousing, Large-scale data management issues, and concepts. Good experience in Data Analytics & Reporting and Modernization project Expertise in at least one high-level programming language such as Java, Python Skills for developing, deploying & debugging cloud applications Skills in AWS API, CLI and SDKs for writing applications Knowledge...

    €660 (Avg Bid)
    €660 Gns Bud
    26 bud

    I am in need of a proficient PySpark coder to aid in debugging errors present within my current code. The main focus of this project is optimization and troubleshooting. Unfortunately, I can't specify the type of errors– I need a professional to help identify and rectify them. If you are an experienced PySpark coder with a keen eye for bug identification and problem solving, I'd appreciate your expertise.

    €6 - €18
    Forseglet
    €6 - €18
    10 bud

    I am in need of a proficient PySpark coder to aid in debugging errors present within my current code. The main focus of this project is optimization and troubleshooting. Unfortunately, I can't specify the type of errors– I need a professional to help identify and rectify them. If you are an experienced PySpark coder with a keen eye for bug identification and problem solving, I'd appreciate your expertise.

    €6 - €18
    Forseglet
    €6 - €18
    6 bud

    I'm searching for a PySpark expert who can provide assistance on optimizing and debugging current PySpark scripts. I am specifically focused on PySpark, so expertise in this area is crucial for the successful completion of this project. Key Responsibilities: - Optimizing PySpark scripts to improve efficiency - Debugging current PySpark scripts to resolve existing issues Ideal Candidate: - Proficient with PySpark - Experience in big data management, data ingestion, processing, analysis, visualization, and reporting - Strong problem-solving skills to identify and resolve issues effectively - Knowledgeable in performance tuning within PySpark.

    €94 (Avg Bid)
    €94 Gns Bud
    65 bud

    I'm looking for a skilled freelancer to create a Spark script that transfers data from a Hive metastore to an S3 bucket. The goal of this project is to enable backup and recovery. Skills and Experience: - Proficiency in Spark and Hive - Extensive experience with S3 buckets - Understanding of data backup strategies Project Details: - The script needs to read the schema and perform metadata transfer for selected schema to s3 bucket. - Only bid if you have work experience with spark, hive, s3 - 4 schemas needs to be migrated - I have already got access to s3 configured - I have local instance of netapp s3 available and bucket created. - Server is Ubuntu

    €91 (Avg Bid)
    €91 Gns Bud
    10 bud

    I am looking for an experienced data analyst who is well-versed in PySpark to clean up a medium-sized dataset in a CSV file format. The file contains between 10k-100k rows, and your primary role will be to: - Remove duplicate data entries - Deduplicate the dataset - Handle missing values - Aggregate the resultant data Your proficiency in using PySpark to automate these processes efficiently will be critical to the success of this project. Therefore, prior experience in handling and cleaning similar large datasets would be beneficial. Please note, this project requires precision, meticulousness, and a good understanding of data aggregation principles.

    €23 (Avg Bid)
    €23 Gns Bud
    9 bud

    This vital task entails cleaning and sorting two CSV files of approximately 100,000 rows and second one of about 1.5million rows using pyspark (Python) in Jupyter Notebook(s). The project consists of several key tasks: Read in both datasets and then: - Standardizing data to ensure consistency - Removal of duplicate entries - Filtering columns we need - Handling and filling missing values - Aggregating data on certain groupings as output Important requirement: I also need unit tests to be written for the code at the end. Ideal Skills: Candidates applying for this project should be adept with Pyspark in Python and have experience in data cleaning and manipulation. Experience with working on datasets of similar size would also be preferable. Attention to detail in ensuring ...

    €164 (Avg Bid)
    €164 Gns Bud
    55 bud

    I'm seeking an experienced Data Engineer with proficiency in SQL and PySpark. Key Responsibilities: - Develop and optimize our ETL processes. - Enhance our data pipeline for smoother operations. The ideal candidate should deliver efficient extraction, transformation, and loading of data, which is critical to our project's success. Skills and Experience: - Proficient in SQL and PySpark - Proven experience in ETL process development - Previous experience in data pipeline optimization Your expertise will significantly improve our data management systems, and your ability to deliver effectively and promptly will be highly appreciated.

    €85 (Avg Bid)
    €85 Gns Bud
    17 bud

    - Conversion of the entire Python code into PySpark. Skills and experience required: - Proficient knowledge in Python.

    €23 (Avg Bid)
    €23 Gns Bud
    26 bud

    ...competent in either PySpark or RDD, using Python to create versatile code fitting for several scenarios. Your main task will be to write code to compare rows using Python in line with the clear set of rules I provide. These rules are detailed in an attached Word document and are based on comparisons encompassing specific columns, presence or absence of particular data, and multiple criteria comparisons. The expected output is a reversal logic for claim_opened_timestamp_utc. I need output that are in right side. I need either in pyspark or in rdd to compare rows. spark - spark-3.3.0-bin-hadoop3 py4j-0.10.9.5 I am using I need your support till I execute it in my office computer I need it in 3 days. Ideal Skills and Experience: - Proficiency in Python - Experience with...

    €143 (Avg Bid)
    Haster
    €143 Gns Bud
    16 bud

    I'm beginer user of Azure Databricks and Pyspark. I'm looking to boost my skills to the next level and need an expert to guide me through advanced techniques. Ideal freelancers should have vast experience and profound knowledge in data manipulation using Pyspark, Azure Databricks, data pipeline construction, and data analysis and visualization. If you've previously tutored or mentored in these areas, it'll be a plus.

    €11 / hr (Avg Bid)
    €11 / hr Gns Bud
    4 bud

    I need complete 2 small projects done. The data needs to be pulled from API using python. The pulled data needs to be unnested, then transformed to answer some insights with medallion architecture. Here, you need to showcase SCD-type 2 ingestions, incremental joins,...to be pulled from API using python. The pulled data needs to be unnested, then transformed to answer some insights with medallion architecture. Here, you need to showcase SCD-type 2 ingestions, incremental joins, managing PII information, aggregation. Final deliverable needed for 1st project (databricks): Data model designed and architecture overview Notebooks of transformations in Python and PySpark/Spark Scala Final deliverable needed for 2nd project (dbt): Data model designed and architecture overview dbt sql and ...

    €254 (Avg Bid)
    €254 Gns Bud
    16 bud

    Looking for someone with good skills in Airflow, Pyspark and SQL.

    €227 (Avg Bid)
    €227 Gns Bud
    12 bud

    I am looking for a skilled professional in Python, with a comprehensive understanding of PySpark, Databricks, and GCP. A primary focus of the project is to build a data pipeline and apply time series forecasting techniques for revenue projection, using historical sales data. Key tasks will include: - Constructing a robust data pipeline using Python, PySpark, and Databricks. - Applying time series forecasting to produce revenue predictions. - Using Mean Squared Error (MSE) to measure model accuracy. The ideal candidate for this project would have: - Proven experience with Python, PySpark, Databricks, and GCP. - Expertise in time series forecasting models. - Practical understanding and use of Mean Squared Error (MSE) for model accuracy. - Experience with large scale ...

    €10 / hr (Avg Bid)
    €10 / hr Gns Bud
    14 bud

    I am looking to develop a sophisticated and efficient data pipeline for revenue forecasting. This pipeline will be implemented using Python, pyspark, databrics, and gcp Big Data. Here is what you need to know about this task: - Data Source: The data originates from Google Cloud Platform's Big Data service. As such, the freelancer should have solid experience and understanding of working with Big Data services on GCP. - Data Update Frequency: The frequency of data updates will be confirmed during the project, but suffice to say frequency could be high. Prior experience with real-time or near-real-time data processing will be highly beneficial. - Performance Metrics: The key performance metric I'm focusing on is data processing speed. The freelancer should have a strong kn...

    €17 / hr (Avg Bid)
    €17 / hr Gns Bud
    13 bud

    I'm in need of a specialist, ideally with experience in data science, Python, PySpark, and Databricks, to undertake a project encompassing data pipeline creation, time series forecasting and revenue forecasting. #### Goal: * Be able to extract data from GCP BigData efficiently. * Develop a data pipeline to automate this process. * Implement time series forecasting techniques on the extracted data. * Use the time series forecasting models for accurate revenue forecasting. #### Deadline: * The project needs to be completed ASAP, hence a freelancer with a good turnaround time is preferred. #### Key Skill Sets: * Data Science * Python, PySpark, Databricks * BigData on GCP * Time series forecasting * Revenue forecasting * Data Extraction and Automation Qualification in...

    €17 / hr (Avg Bid)
    €17 / hr Gns Bud
    15 bud

    I am looking for a developer to create an AWS Glue and Pyspark script that will strengthen the data management of my project. The task involves moving more than 100GB of text data from a MySQL RDS table to my S3 storage account, on a weekly basis. Additionally, the procured data needs to be written on parquet files, for easy referencing. The developer will also need to send scripts to deploy the AWS Glue pipelines on Terraform, fitting all parameters. Skilled expertise in AWS Glue, PySpark, Terraform, MySQL and experience in handling large data is required. There is no compromise on the quality and completion timeline. Effective performance on this project will open doors to more work opportunities on my various projects.

    €38 (Avg Bid)
    €38 Gns Bud
    15 bud

    I am seeking a skilled professional proficient in managing big data tasks with Hadoop, Hive, and PySpark. The primary aim of this project involves processing and analyzing structured data. Key Tasks: - Implementing Hadoop, Hive, and PySpark for my project to analyze large volumes of structured data. - Use Hive and PySpark for sophisticated data analysis and processing techniques. Ideal Skills: - Proficiency in Hadoop ecosystem - Experience with Hive and PySpark - Strong background in working with structured data - Expertise in big data processing and data analysis - Excellent problem-solving and communication skills Deliverables: - Converting raw data into useful information using Hive and Visualizing the results of queries into the graphical representation...

    €16 / hr (Avg Bid)
    €16 / hr Gns Bud
    15 bud

    ...currently searching for an experienced AWS Glue expert, proficient in PYsPARK with data frames and Kafka development. The ideal candidate will have: • Expertise in data frame manipulation. • Experience with Kafka integration. • Strong PYsPARK development skills. The purpose of this project is data integration, and we will be primarily processing data from structured databases. The selected freelancer should be able to work with these databases seamlessly, ensuring efficient and effective data integration using AWS Glue. The required work would involve converting structured databases to fit into a data pipeline, setting up data processing, and integrating APIs using Kafka. This project requires a strong background in AWS Glue, PYSPARK, data frame ...

    €216 (Avg Bid)
    €216 Gns Bud
    24 bud

    I'm seeking assistance to develop a Python-based solution utilizing PySpark for efficient data processing using the Chord Protocol. This project demands an intermediate level of expertise in Apache Spark or PySpark, combining distributed computing knowledge with specific focus on Python programming. Key Requirements: - Proficiency in Python programming and PySpark framework. - Solid understanding of the Chord Protocol and its application in data processing. - Capable of implementing robust data processing solutions in a distributed environment. Ideal Skills and Experience: - Intermediate to advanced knowledge in Apache Spark or PySpark. - Experience in implementing distributed file sharing or data processing systems. - Familiarity with network communicati...

    €501 (Avg Bid)
    €501 Gns Bud
    38 bud

    Build a glue etl using pyspark to transfer data from mysql to postgres. facing challenges in column mappings between the 2 sources, the target database has datatypes enums and text arrays. should solve the erros in column mappings Should have prior experience ingesting data into postgres enum datatype

    €20 / hr (Avg Bid)
    €20 / hr Gns Bud
    54 bud

    I am in need of an experienced data engineer with specific expertise in PySpark. This project involves the integration and migration of data from structured databases currently housed in AWS. Here's a rundown of your key responsibilities: - Data integration from various existing structured databases - Migration of the combined data to a single, more efficacious database Ideal Candidate: - Proven experience in data migration and integration projects - Expertise in PySpark is indispensable - Proficiency in manipulating AWS databases - A solid understanding of structured databases and various data formats is mandatory This project is more than just technical skills- I'm looking for someone who can understand the bigger picture and contribute to the overarching str...

    €608 (Avg Bid)
    €608 Gns Bud
    13 bud

    I'm looking for a professional with a strong understanding of PySpark to help transform a dataframe into JSON following a specific schema. This project's main task is data transformation to aid in data interchange. The project requires: - Expertise in PySpark - Proficiency in data transformation techniques - Specific experience in data aggregation For the transformation, I require the application of an aggregation method. In this case, we will be sorting the data. It's crucial that you are skilled in various aggregation methods, especially sorting. Your knowledge in handling critical PySpark operations is crucial for this job's success. Experience in similar projects will be highly regarded.

    €22 (Avg Bid)
    €22 Gns Bud
    18 bud

    Looking for an expert Azure Data Engineer to assist with multiple tasks. Your responsibilities will include: - Implementing and managing Azure Data Lake and Data Ingestion. - Developing visual reports...platforms to achieve three main objectives: - Perform sophisticated data analysis and visualization. - Enable advanced data integration and transformation. - Build custom applications to meet specific needs. Candidates should have an advanced understanding of Azure Data Lake, Power BI, and Powerapps, bringing a minimum of 6 years experience as Databricks. Proficiency in Python, SQL, PostGre SQL, and Pyspark is also required. Knowledge of GitHub and the CI/CD Process will be beneficial for this role. If you have the skills and expertise needed for this project, I'd love to...

    €31 / hr (Avg Bid)
    €31 / hr Gns Bud
    28 bud

    ...need to be pushed swiftly to Elasticsearch using Pyspark. Your expertise will help push all data columns from this file into Elasticsearch, establishing a more actionable access to a significant amount of data. Given the project's urgency, I'm expecting a rapid, reliable transition. While the structure for the documents remains undecided due to the project's intricacies, I'm open to suggestions that will make this process more efficient and effective. Anyone with experience in Pyspark, Elasticsearch, and vast data manipulation will have a substantial edge on this project, as these skills are highly necessary for success. A strong understanding of different data structures is also a plus. • Leading Skills Required: Proficiency in Pyspark ...

    €9 / hr (Avg Bid)
    €9 / hr Gns Bud
    3 bud

    ...Title: Pyspark Data Engineering Training Overview: I am a beginner/intermediate in Pyspark and I am looking for a training program that focuses on data processing. I prefer one on one and written guides as the format for the training. Skills and Experience Required: - Strong expertise in Pyspark and data engineering - Excellent knowledge of data processing techniques - Experience in creating and optimizing data pipelines - Familiarity with data manipulation and transformation using Pyspark - Ability to explain complex concepts in a clear and concise manner through written guides - Understanding of best practices for data processing in Pyspark Training Topics: The training should primarily focus on data processing. The following topics should be cov...

    €21 / hr (Avg Bid)
    €21 / hr Gns Bud
    72 bud

    ...training is expected to be spread across multiple days. The trainer must have the capability to provide an understanding of the major concepts and components of Apache Spark, with a focus on how to use Databricks and the Pyspark API to manipulate and visualize data. As the training progresses, the instructor should be able to explain how to develop applications using Pyspark and articulate different approaches that a data scientist would use to evaluate and test their models. The instructor should also be able to educate the users on how to deploy and maintain Pyspark applications and how to provide feedback and questions in order to improve their performance. We expect the trainer to be readily available to answer any questions and guide the users along the w...

    €93 (Avg Bid)
    €93 Gns Bud
    75 bud

    I am seeking an expert in the field to provide remote training in the use of Databricks and Python with PySpark. This is important for developing data processing applications with a high degree of efficiency. The training should cover areas such as data wrangling, machine learning, and Spark streaming. In order to be successful, attendees must be well-versed in Databricks, Python and PySpark, as these skills will be essential for completing the course. The course should provide a good understanding of the concepts and practical application of these tools. This training will give attendees the skills they need to analyse and manipulate large datasets, develop effective data processing pipelines, design powerful machine learning models and build reliable applications that use...

    €99 (Avg Bid)
    €99 Gns Bud
    74 bud

    ...S3, and RDS; Azure services; and Pyspark data processing and transformations. Essential Skills: - Proficient in AWS, specifically on EC2, S3, RDS with strong understanding of data storage and retrieval. - Expert in Azure services such as Azure SQL Database and Blob Storage. - Highly experienced in writing efficient data transformations using Pyspark. Ideal Experience: - Minimum 7 years in the field with solid experience in technical interviews and coaching. Your task will be to provide actionable insights, best practices, and expert advice to nail my upcoming technical interview. Having been on the other side of the interview table would be an added advantage. - Proven track record of performing successful data processing and transformations using Pyspark. - Prev...

    €14 / hr (Avg Bid)
    €14 / hr Gns Bud
    8 bud

    Experienced Python + SQL +AWS +AZURE data engineer (7+ years) for evening IST timings. For guiding in interview preparation specially for data engineering. Tasks: Should have good knowledge of pyspark, sql, pandas Should have written multiple ETL pipeline in aws and azure. Note: The freelancer must be available during evening ist timings.

    €9 / hr (Avg Bid)
    €9 / hr Gns Bud
    12 bud

    ...structured data such as SQL databases. Skills and experience required: - Expertise in AWS migration, specifically from another cloud provider - Strong knowledge and experience with structured data, particularly SQL databases - Familiarity with AWS Glue and Athena for data processing and analysis - Ability to work with a combination of different AWS services for optimal performance and efficiency Pyspark ,sql,python Cdk Typescript Aws glue ,Emr and andes Currently Migrating from teradata to aws. Responsibilities: - Migrate data from another cloud provider to AWS, ensuring a smooth transition and minimal downtime - Design and develop applications that utilize AWS Glue and Athena for data processing and analysis - Optimize data storage and retrieval using AWS S3 and R...

    €8 / hr (Avg Bid)
    €8 / hr Gns Bud
    14 bud

    ...am looking for a skilled and experienced developer to work on a personal project involving the use of CNN by pyspark for analyzing brain and lung cancer. Skills and Experience: - Proficient in using pyspark and CNN - Intermediate understanding of convolutional neural networks - Familiarity with analyzing medical data - Experience in working with cancer-related datasets - Strong problem-solving skills and attention to detail The project requires the use of specific datasets, which I already have. However, any additional assistance in acquiring relevant datasets would be appreciated. The ideal candidate should have a good understanding of CNN and be able to apply it using pyspark. Experience in analyzing medical data and working with cancer-related datasets would ...

    €33 (Avg Bid)
    €33 Gns Bud
    10 bud

    I am looking for a skilled professional who can help me with a project titled "synapse pyspark delta lake merge scd type2 without primary key". The ideal candidate should have experience and expertise in the following areas: Desired Outcome: - The desired outcome of the merge process is to update existing records and insert new records. Data Quality: - The level of data quality required for the outcome is high integrity, with no duplicates and full accuracy. Handling Historical Data: - There is a specific requirement to keep track of historical changes to the data. Skills and Experience: - Proficiency in Synapse, Pyspark, Delta Lake - Experience with SCD Type 2 implementation - Strong understanding of data integrity and accuracy - Ability to handle historical da...

    €304 (Avg Bid)
    €304 Gns Bud
    2 bud

    ...Senior Data Engineer who possesses extensive experience and proficiency in a range of key technologies and tools. The ideal candidate should have a strong background in Python, demonstrating skillful use of this programming language in data engineering contexts. Proficiency in Apache Spark is essential, as we rely heavily on this powerful analytics engine for big data processing. Experience with PySpark, the Python API for Spark, is also crucial. In addition to these core skills, we require expertise in AWS cloud services, particularly AWS Glue and Amazon Kinesis. Experience with AWS Glue will be vital for ETL operations and data integration tasks, while familiarity with Amazon Kinesis is important for real-time data processing applications. Furthermore, the candidate should hav...

    €10 / hr (Avg Bid)
    €10 / hr Gns Bud
    11 bud

    I am looking for an Airflow, GCP, and Python expert to assist me with my project. Candidate should have a good knowledge of DAG, GIT, pandas, agile, pyspark and Airflow.

    €274 (Avg Bid)
    €274 Gns Bud
    18 bud

    I am looking for a freelancer who can assist me with a Pyspark AWS ML project. The main goal of the project is data processing and transformation. I already have all the data needed for the project. The preferred timeline for this project is flexible. Skills and Experience: - Strong experience with Pyspark and AWS ML - Proficient in data processing and transformation techniques - Familiarity with machine learning model development - Ability to work within a flexible timeline

    €14 / hr (Avg Bid)
    €14 / hr Gns Bud
    28 bud

    Years of experience: 7+ Location: Remote - India Contract Tenure - 03-06 Months Notice Period - Immediate -15/20 Days Timings : 12pm - 9pm IST M - F AWS Data Engineer Requirements • Collaborate with business an...functions to handle data quality and validation. • Should have good understanding on S3,Cloud Formation, Cloud Watch, Service Catalog and IAM Roles • Perform data validation and ensure data accuracy and completeness by creating automated tests and implementing data validation processes. • Should have good knowledge about Tableau, with creating Tableau Published Datasets and managing access. • Write PySpark scripts to process data and perform transformations.(Good to have) • Run Spark jobs on AWS EMR cluster using Airflow DAGs.(Good to have) &...

    €3028 (Avg Bid)
    €3028 Gns Bud
    16 bud

    Looking for someone who has a good knowledge of Pyspark, Airflow DAGs, GitHub, Pandas and Agile Framework. Overall candidate should be well aware of the data ingestion approach. Knowledge of Google cloud platform is a Bonus

    €273 (Avg Bid)
    €273 Gns Bud
    24 bud