Senior Python Engineer - Data Engineering &Data Science Generalist - remote
Baker Engine
Posted 2 years ago
Baker Engine is a new company building an advanced sports prediction engine.
If you like sports and writing Python, this could be a really fun opportunity for you.
You can read more about the product here: https://docs.bakerengine.com/
This position is for the first employee in addition to the technical founder.
If you like sports and writing Python, this could be a really fun opportunity for you.
You can read more about the product here: https://docs.bakerengine.com/
This position is for the first employee in addition to the technical founder.
Job Description
- Write Python code that incorporates the rules and tendencies of different major American sports leagues to simulate them down to a play-by-play level.
- Identify areas in the simulation that would benefit from machine learning models.
- Use our existing all-access sports data feeds to build and train ML models in Python to account for that part of the simulation.
- Perform statistical analysis on ML models and fine-tune them using test results.
- Incorporate the ML code into our experiment and model registry system.
- Write Python DAGs to manage data syncing, ETL data processing and ML model training.
- Deploy code to production with our Github Actions that manage AWS deployments.
- Collaborate remotely with the founder on what areas of the system to work on next.
- Share on-call duty of production systems.
Requirements
- Excellent Python developer with attention to detail.
- Well-rounded data generalist with experience with a lot of the Python data ecosystem: Pandas, NumPy, SciPy &Scikit-learn.
- Familiarity with deep learning frameworks such as TensorFlow and PyTorch.
- Experience writing automated tests with pytest.
- Experience writing and maintaining DAGs for data pipelines to train and deploy machine learning models.
- Experience with the full data science lifecycle of EDA, feature engineering, model selection, training and validation.
- Experience with both regression and classification models and how to evaluate them.
- Knowledge of statistics fundamentals.
- Experience with AWS and basic systems administration in a cloud environment.
- Experience building REST APIs in Python with Flask or FastAPI.
- Experience with both SQL and NoSQL databases.
- Pragmatic about software architecture.
- Proactive &ambitious.
- Able to work independently with minimal requirements.
- Enjoys responsibility and lots of focused work time with very few Zoom calls.
- Knowledge of the NFL and at least one other major American sport.
- Excellent written and verbal communication skills.
- Optimistic team player.
- Great analytical and problem-solving skills.
- Love working with sports data and predicting the outcome of sporting events.
- Legally authorized to work in the United States of America.
- Able to work fully remotely. We have no office, so there is no risk of relocation or going “back to the office.”
- Nice to have: Advanced data science background.
- Nice to have: Detailed knowledge of many popular sports (NFL, NBA, MLB, NHL, NCAAF, NCAAB)
Compensation and Benefits
- Salary range is $130,000 - $150,000 per year.
- Unlimited PTO
- Health, vision and dental insurance
- Paid maternity &paternity leave