Data Engineer
Company Name:-
Spokeo
Job Location:-
Pasadena, CA 91101 (South area)
Job Summary:-
Job detailsJob TypeFull-timeFull Job DescriptionSpokeo is a people search engine that both enlightens and empowers our customers.
With over 12 billion records and 18 million visitors per month, we reconnect friends, reunite families, prevent fraud, and more.
Every day our nimble team takes on enormous challenges in data science that push the limits of the cloud and search architecture.
As a Data Engineer at Spokeo, you will be responsible for developing, optimizing and maintaining the ETL data pipeline.
This involves working with infrastructure built in AWS, including Spark EMR, S3 and DynamoDB.
Additionally, this role will help build analytical tools, develop unit and stress tests, and create automation surrounding the orchestration of the ETL data pipeline.
Responsibilities:
Build infrastructure and automation for the extraction, preparation, and loading of data from various sources
Create unit and stress test components to monitor technical performance and ensure identified issues are resolved
Build and maintain analytical tools to provide data insight and capture key metrics
Automate and integrate new components into the data pipeline.
Utilize best practices for data governance, data quality, data cleansing, and other ETL related activities.
Maintain technical documentation
Requirements:
2+ years development experience in data engineering
1+ years professional experience working in big data ecosystems, preference for Spark
1+ years professional experience working with dataflow management tools, such as Airflow
1+ years experience working with Pentaho (or equivalent tools such as Talend, DataStage, and Informatica)
Hands-on scripting experience with Python, Scala and/or shell scripting
Preference for development experience in highly-scalable, distributed systems and cluster architectures (e.
g.
AWS, Azure, Google Cloud, etc)
Familiarity with complex NoSQL databases (e.
g.
DynamoDB, Cassandra, Elasticsearch, etc)
Prior experience working with large data sets (>1M+ records)
B.
S.
preferred in Computer Science, Information Systems, or related fields (foreign education equivalent accepted)
Privacy Notice for Candidates: https://www.
spokeo.
com/recruiting-policy
Spokeo is an equal opportunity employer.
FOR MORE DETAILS CLICK BELOW LINK