Lukket

Google Cloud DataFlow

The test is a typical de-normalization task that is performed frequently when loading data to BigQuery.

The test itself doesn’t require interaction with BigQuery, as we find that final output of transformed

data to BigQuery is the easy part. The transformations in Google DataFlow are more complex and this is

what we would like you to do.

You’ll be given 3 files in gzip-archived JSON format that we receive from Spotify API: streams, tracks and

users. Your job is to develop two pipelines in Google DataFlow (one in Java and one in Python) to

denormalize these three files into one flat output JSON file.

Evner: Google Cloud Platform

Se mere: python write pdf files google cloud standard library, j2ee google cloud, google cloud print java code, google cloud print api java, google cloud print java, bid projects google cloud, google cloud print java application, print google cloud print java, google cloud dataflow, google-cloud-dataflow python, google cloud dataflow tutorial, spring cloud dataflow task example, google cloud dataflow use cases, name two use cases for google cloud dataflow, google cloud dataflow python, google cloud dataflow python examples, google cloud long-running task, google cloud task delay

Om arbejdsgiveren:
( 0 bedømmelser ) Frisco, United States

Projekt ID: #31491614

1 freelancer byder i gennemsnit $250 på dette job

(2 bedømmelser)
2.8