We are looking for an excellent Software Engineer for our client, experienced in high quality production Python code. You would work on the new ML and AI platform involving several Sprint teams around the new AI/ML product.
Job Responsibilities:
Basic Qualifications:
Job Responsibilities:
- Design, develop, and launch strategic data mining solutions, driving business-wide innovation utilizing AI and analytical techniques at scale.
- Take full ownership of the end-to-end software development lifecycle, encompassing design, testing, deployment, and operations, engage in technical discussions and strategy, and participate hands-on in design reviews, code reviews, and implementation.
- Participate in the development of highly scalable platforms for extracting, analyzing, and processing large amounts of contextual data from a plethora of sources, both in real-time and in batch modes.
- Craft high-performance, production-ready machine learning code for our next-generation real-time ML platform. Extend existing ML libraries and frameworks.
- Working closely with other engineers and scientists, develop solutions to accelerate model development, validation, and experimentation cycles, and integrate models and algorithms in production systems at a very large scale.
Basic Qualifications:
- Degree in mathematics/computer science or related discipline.
- 4+ years of experience in the complete software development lifecycle including design, coding, code reviews, testing, build processes, deployments, and operations.
- 3+ years of experience in Python.
- Required one of these 3 skills setups:
- Spark, + Py Spark + S3, + Parquet
- Spark + Py Spark + Docker + Kubernetes
- Microservices + Flask + Fast API Preferred Qualifications:
- MS or PhD in Computer Science or equivalent experience.
- Experience working with large-scale distributed systems, preferably on cloud platforms (e.g., AWS, Azure, Google Cloud).
- Experience dealing with real-world large-scale datasets.
- Strong understanding and passion for statistical/mathematical modeling and data analysis.