closed vacancy Senior Data Engineer ETL Pipelines - Python - Cybersecurity - remote
Senior Data Engineer –ETL Pipelines - Python - Cybersecurity
Full-Time | 100% Remote
Austin, TX
Please, no agencies
*Do you like creating pipelines to ingest and maintain complex data sets for use in ML models?
*Have you worked with machine learning teams and data scientists?
*Do you have a passion for using your experience with Python, Spark, or similar technologies like Hive/Kafka/HBase?
*Do you like pirates, and Arrrrr you interested in writing code that provides intrusion detection?
*Do you want to help Google, Microsoft, Salesforce, and Cisco defend against cyber attacks?
*Do you want stock? A fat base? And a bonus?
*And finally, do you want to work for cool managers who care about clean code?
THEN CHECK. THIS. OUT.
Position Overview:
From software hacking to hardware hacking, we help secure everything from cryptocurrency exchanges and space telescopes to autonomous vehicles and the electric grid. Today, our client is making significant investments in terms of financial and engineering resources to develop a radically new customer experience we call “Security-as-a-Service”to provide customers with a unified, efficient, and data-driven security platform. We’re looking to add the right individual to their growing team supporting the next wave of cybersecurity products and solutions.
As part of that investment, we’re seeking a seasoned Data Engineer with a successful track record in data engineering in a hyper growth company setting. You will have the opportunity to work with some of the best security engineers in the world who hail from organizations such as Amazon, CIA, Facebook, Google, Microsoft, NSA, Redhat, Sun Microsystems, and US Air Force. As an Inc. Best Places to Work, Inc. 500 | 5000, Cybersecurity 500, and Austin Fast 50 Award recipient, we are seeking an individual that understands the professional and personal growth attached to this opportunity and who has the corresponding internal drive to maximize it.
Career opportunity:
- Join an industry with massive socio, economic, and political importance in the 21st century
- Work alongside some of the best and the brightest minds in the security industry
- Leave an indelible mark on a company where individual input has real impact
- Be recognized, internally and publicly, for your contributions in a high-profile position
- Align your career trajectory with a hyper growth company that is on the move
Core responsibilities:
- Create pipelines to ingest and maintain complex data sets into our client’s data stores for use in machine learning models
- Create tools to scour the internet to find important security information and ingest it into their infrastructure
- Work with data scientists to create and maintain data ontologies for security
- Create the roadmap of how to continually evolve the data engineering infrastructure and techniques to improve our client’s ability to find security information
- Mentor junior data engineers and teach them how to use data engineering techniques to solve real world problems
- Communicate complex concepts to team members
Accountable for:
- Creation of data engineering pipelines to find and ingest security vulnerabilities
- Creation of data engineering tools to help label and validate data
Required qualifications:
- At least 8 years experience designing and building data processing/ETL pipelines
- At least 8 years experience in Python and Spark or similar technologies [[ Hive / Kafka / HBase ]]
- At least 8 years experience with SQL and relational databases
- At least 8 years experience parsing flat files
- At least 8 years development experience
- Prior track record in a hyper-growth, high-tech company
- Bachelor's degree or equivalent practical experience
Desired qualifications:
- Experience working with Google Tensorflow
- Experience with modern technology stacks
- Experience with micro-services architectures
- Experience with cloud platforms and SaaS solutions
- Experience with agile/scrum development practices
- Experience with test driven development, continuous integration, continuous deployment
- Experience with Git, JIRA, Confluence
- Experience with Google Compute, Firebase, and GKE
- Experience with Docker
Desired behaviors:
- Relentless restlessness to turn theory into practice and develop production worthy code that solves real-world customer problems
- Determination to always learn and get better and never rest on ones laurels
- Personable individual who enjoys working in a team-oriented environment
- Comfort dealing with ambiguity in an environment where we build the plane as we fly it
- Ability to work within constraints and to challenge the status quo
- Ability to self-direct work and truly own the position in a hyper-growth environment
Compensation package:
- Competitive compensation
- Ownership opportunity through employee stock option plan
- Health, dental, and vision insurance
- 4% company 401K matching vested immediately