Job Description
1. Very strong data engineering with Python, Pyspark, and Spark and expertise in extracting data from a database, cleaning the data, and preparing it for model consumption. Feature Engineering. All of our data is in AWS, S3, Dynamo, Aurora, and RedShift. 2. Experience with Machine Learning and modeling on AWS with SageMaker. Strong knowledge of AWS services and ability to write Lambda Functions - feature extractionfeature definition, data validation, model monitoring, and model optimization. ML Libraries (scikit-learn, XGBoost, MXNet, Tensorflow, R), ML Frameworks (Airflow, MLFlow, Kubeflow) 3. Experience with large-scale production and high traffic machine learning. Experience with web services.