✦ Luna Orbit — Data & Analytics

Data Scientist II - AMZ9774210

at Amazon.com

📍 US, NY, New York Unknown 💰 $169K – $169K USD / year Posted April 14, 2026
Salary $169K – $169K USD / year
Type Full-Time
Experience mid
Exp. Years one year of experience in the job offered or a related occupation (or five years progressive post-baccalaureate experience as equivalent to the Master's degree)
Education Master's degree in Statistics, Applied Mathematics, Economics, Engineering, Computer Science, or a related field
Category Data & Analytics

Design and implement scalable data science approaches to support or automate business decision making. Build data pipelines using SQL/ETL, develop a wide range of statistical and machine learning models, and validate results against expected outcomes and business KPIs.

  • Design and implement scalable approaches to automate decision making
  • Build SQL/ETL queries and acquire data from Oracle, RedShift, and Spark
  • Develop models using statistical, econometric, network, NLP, ML, genetic algorithms, and neural networks
  • Validate models against alternative approaches and business KPIs
  • Implement models that meet computational demands, accuracy, and reliability requirements

Works with Oracle, RedShift, and Spark storage systems using SQL and ETL to acquire and migrate data. Builds models spanning statistical, econometric, network/social network, natural language processing, machine learning, genetic algorithms, and neural networks, validating against KPIs and production constraints.

The ideal candidate is a Data Scientist II who has 1+ year of experience building statistical and machine learning models on large datasets. They can write SQL scripts for analysis and data migration, acquire data using SQL/ETL, and model using tools such as R, Python, or MATLAB while validating models against business KPIs.

building statistical models and machine learning models using large datasets from multiple resourceswriting SQL scripts for analysis and data migrationapplying specialized modelling software including RPythonor MATLABacquiring data by building the necessary SQL / ETL queriesimport processes through various company specific interfaces for accessing OracleRedShiftand Spark storage systems
SQLETLOracleRedShiftSparkRPythonMATLAB
SQLETLOracleRedShiftSparkunivariate distributionsbivariate relationshipstransformationsanomaly investigationstatistical modelingeconometric modelingnetwork modelingsocial network modelingnatural language processingmachine learning algorithmsgenetic algorithmsneural networksmodel validationkey performance indicatorsRPythonMATLABdata migration
scalable and reliable approaches to support or automate decision makingdata science techniques and toolsSQL queriesETL queriesOracleRedShiftSparkdata acquisition through company specific interfacesunivariate distributionsbivariate relationshipsdata transformationsanomaly investigationstatistical modelingmathematical modelingeconometric modelingnetwork modelingsocial network modelingnatural language processingmachine learning algorithmsgenetic algorithmsneural networksmodel validationcomputational demands evaluationaccuracy evaluationreliability evaluationkey performance indicatorsRPythonMATLABsocial network modelinggenetic algorithms
stakeholder relationship buildingcollaboration with stakeholders and counterpartscommunicationproblem solving when solution approach is unclear
Industry SaaS
Job Function Build and validate machine learning and statistical models using SQL/ETL data pipelines to automate business decisions
Role Subtype Data Scientist
Tech Domains Python, SQL / PostgreSQL, Amazon Web Services
Data Scientist IIData Scientistdecision makingautomate decision makingscalable and reliableSQLETLOracleRedShiftSparkunivariate distributionsbivariate relationshipstransformationsanomaliesstatistical modelingmathematical modelingeconometric modelingnetwork modelingsocial network modelingnatural language processingmachine learning algorithmsgenetic algorithmsneural networksmodel validationkey performance indicatorscomputational demandsaccuracyreliabilityRPythonMATLABdata migrationand Spark storage systemsImport processes

Must have 1 year of experience building statistical and machine learning models using large datasets from multiple resources, Must have 1 year of experience writing SQL scripts for analysis and data migration, Must have 1 year of experience applying specialized modeling software including R, Python, or MATLAB

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile