Big Data Engineer (Remote)
Company Name:-
CrowdStrike
Job Location:-
Crescent City, CA 95532
Job Summary:-
At CrowdStrike were on a mission – to stop breaches.
Our groundbreaking technology, services delivery, and intelligence gathering together with our innovations in machine learning and behavioral-based detection, allow our customers to not only defend themselves, but do so in a future-proof manner.
Weve earned numerous honors and top rankings for our technology, organization and people clearly confirming our industry leadership and our special culture driving it.
We also offer flexible work arrangements to help our people manage their personal and professional lives in a way that works for them.
So if youre ready to work on unrivaled technology where your desire to be part of a collaborative team is met with a laser-focused mission to stop breaches and protect people globally, lets talk.
About the Role:
We are looking to hire a Big Data Engineer for the Data Engineering team at CrowdStrike.
The Data Engineering team operates within the Data Science organization, and provides the necessary infrastructure and automation for users to analyze and act on vast quantities of data effortlessly.
The team has one of the most critical roles to play in ensuring our products are best-in-class in the industry.
You will interact with product managers and other engineers in building both internal and external facing services.
You will:
Write jobs using P y S park to process billions of events per day
Fine tune existing Hadoop / S park clusters
Rewrite some existing PIG jobs in P y S park
Key Qualifications:You have:
BS degree in Computer Science or related field
4+ years of relevant work experience
Experience in b uilding data pipelines at scale ( Note: We process over 1 Trillion events per week)
Good knowledge of Hadoop / Spark /Apache Kafka, Python, AWS, PySpark and other tools in the Big Data ecosystem
Good programming skills Python
Operation experience in the tuning of clusters for optimal data processing
Experience in b uilding out ETL jobs at scale
Good knowledge of distributed system design and associated tradeoffs
Good knowledge of CI / CD and associated best practices
Familiarity with Docker-based development and orchestration
Bonus points awarded if you have:
Created automated / scalable infrastructu
FOR MORE DETAILS CLICK BELOW LINK [convertful id=”110657″]