Position Details

Salary $169K – $169K USD / year

Type Full-Time

Experience mid

Exp. Years one year of experience in the job offered or a related occupation (or five years progressive post-baccalaureate experience as equivalent to the Master's degree)

Education Master's degree in Statistics, Applied Mathematics, Economics, Engineering, Computer Science, or a related field

Category Data & Analytics

About this role

Design and implement scalable data science approaches to support or automate business decision making. Build data pipelines using SQL/ETL, develop a wide range of statistical and machine learning models, and validate results against expected outcomes and business KPIs.

Key Responsibilities

Design and implement scalable approaches to automate decision making
Build SQL/ETL queries and acquire data from Oracle, RedShift, and Spark
Develop models using statistical, econometric, network, NLP, ML, genetic algorithms, and neural networks
Validate models against alternative approaches and business KPIs
Implement models that meet computational demands, accuracy, and reliability requirements

Technical Overview

Works with Oracle, RedShift, and Spark storage systems using SQL and ETL to acquire and migrate data. Builds models spanning statistical, econometric, network/social network, natural language processing, machine learning, genetic algorithms, and neural networks, validating against KPIs and production constraints.

Ideal Candidate

The ideal candidate is a Data Scientist II who has 1+ year of experience building statistical and machine learning models on large datasets. They can write SQL scripts for analysis and data migration, acquire data using SQL/ETL, and model using tools such as R, Python, or MATLAB while validating models against business KPIs.

Must-Have Skills

building statistical models and machine learning models using large datasets from multiple resourceswriting SQL scripts for analysis and data migrationapplying specialized modelling software including RPythonor MATLABacquiring data by building the necessary SQL / ETL queriesimport processes through various company specific interfaces for accessing OracleRedShiftand Spark storage systems

Tools & Platforms

SQLETLOracleRedShiftSparkRPythonMATLAB

Required Skills

SQLETLOracleRedShiftSparkunivariate distributionsbivariate relationshipstransformationsanomaly investigationstatistical modelingeconometric modelingnetwork modelingsocial network modelingnatural language processingmachine learning algorithmsgenetic algorithmsneural networksmodel validationkey performance indicatorsRPythonMATLABdata migration

Hard Skills

scalable and reliable approaches to support or automate decision makingdata science techniques and toolsSQL queriesETL queriesOracleRedShiftSparkdata acquisition through company specific interfacesunivariate distributionsbivariate relationshipsdata transformationsanomaly investigationstatistical modelingmathematical modelingeconometric modelingnetwork modelingsocial network modelingnatural language processingmachine learning algorithmsgenetic algorithmsneural networksmodel validationcomputational demands evaluationaccuracy evaluationreliability evaluationkey performance indicatorsRPythonMATLABsocial network modelinggenetic algorithms

Soft Skills

stakeholder relationship buildingcollaboration with stakeholders and counterpartscommunicationproblem solving when solution approach is unclear

Industry & Role

Industry SaaS

Job Function Build and validate machine learning and statistical models using SQL/ETL data pipelines to automate business decisions

Role Subtype Data Scientist

Tech Domains Python, SQL / PostgreSQL, Amazon Web Services

Keywords for Your Resume

Data Scientist IIData Scientistdecision makingautomate decision makingscalable and reliableSQLETLOracleRedShiftSparkunivariate distributionsbivariate relationshipstransformationsanomaliesstatistical modelingmathematical modelingeconometric modelingnetwork modelingsocial network modelingnatural language processingmachine learning algorithmsgenetic algorithmsneural networksmodel validationkey performance indicatorscomputational demandsaccuracyreliabilityRPythonMATLABdata migrationand Spark storage systemsImport processes

Deal Breakers

Must have 1 year of experience building statistical and machine learning models using large datasets from multiple resources, Must have 1 year of experience writing SQL scripts for analysis and data migration, Must have 1 year of experience applying specialized modeling software including R, Python, or MATLAB

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile

Data Scientist II - AMZ9774210

Get matched to jobs like this