
Senior Infrastructure Engineer
Activeloop, an autodata company. We connect raw data to machine learning models, seamlessly. We empower data scientists to focus on training ML models, instead of messing with the data. We enable organizations to unlock the true potential of the unstructured data, faster and cheaper.
The company is founded by PhDs from Princeton University and backed by Y Combinator and other prominent investors from Silicon Valley.
Skills: Kubernetes, Puppet, Google Cloud, Unix, Distributed Systems, Docker, Jenkins, Serverless, Elastic Stack (ELK), Amazon Web Services (AWS)
We are looking for our first senior infrastructure engineer to build the managed version of Activeloop Hub (https://github.com/activeloopai/hub). It is an open-source unstructured dataset management tool for large scale Machine Learning workloads. Hub has been trending #2 across all open-source software and #1 in python language. It has been growing 77% MoM. We are building Data 2.0 for Software 2.0.
Requirements
- [Preferred] Experience with building distributed systems, understanding edge cases, failure modes, behaviors, specific implementation tradeoffs.
- [Preferred] Extensive experience with cloud systems (AWS/GCP/Azure) including knowledge of Docker, Kubernetes, * CI/CD tools (such as CircleCI or similar), infrastructure-as-code tools (Terraform/Cloud Formation)
- Ability to pick the best tool for the job and integrate an array of technologies into a reliable High Performance Computing solution.
- [bonus] Data Streaming and Storage/File systems (Object Store, NFS)
- [bonus] Have strong programming and scripting skills - Python, Ruby and/or Go
We also expect you to:
- We expect the person to grow to a team lead role and be eventually responsible for all technical aspects of the SRE function.
- To join a highly motivated, curious, hardworking explorers in the field of AI
- Have a builder attitude - you love building cool things that matter!
- Work closely with the founding team in developing hyper-scalable software for ML.
- Proactively identify and anticipate problems and provide tangible solutions.
- Enjoy and be ready for the startup journey towards building endurable, scalable business.
We are building Data 2.0 https://github.com/activeloopai/Hub
The landscape of computation resources across different special hardware and cloud providers is becoming increasingly fragmented.
We're building a platform that unifies and abstracts away infrastructure for easier and highly efficient machine learning and deep learning.