Skip to content
This repository was archived by the owner on Feb 2, 2024. It is now read-only.

GetLinkfire/python_engineer_task

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 

Repository files navigation

Python Engineer Exercise

Welcome! We have prepared a little exercise for you.

The task is to build a miniature data pipeline: In lack of a streaming source, pull the sample data set via https, do some simplified validation, transformation, and splitting of the data, and write the output to disk.

In more detail, the program should do the following, not necessarily in that order:

  • Fetch our zipped line-json file with sample data from here.
  • Output (not gzipped) line-json files again.
  • Process one line at a time, pretending that you're processing a stream.
  • Whenever there is a convvalue, use the rates.json and convvalueunit to convert the value to USD. Write that value to a new field convusdvalue.
  • Split records by values in the type field and create one output file for each type.
  • Validate that the linkid is a valid UUID, using the standard library. Invalid records must go to their own output file, e.g. "deadletters.json".

About

Public test assignment for Python Engineers. A small exercise to solve for applicants

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors