About this role
Design and implement scalable data science approaches to support or automate business decision making. Build data pipelines using SQL/ETL, develop a wide range of statistical and machine learning models, and validate results against expected outcomes and business KPIs.
Key Responsibilities
- Design and implement scalable approaches to automate decision making
- Build SQL/ETL queries and acquire data from Oracle, RedShift, and Spark
- Develop models using statistical, econometric, network, NLP, ML, genetic algorithms, and neural networks
- Validate models against alternative approaches and business KPIs
- Implement models that meet computational demands, accuracy, and reliability requirements
Technical Overview
Works with Oracle, RedShift, and Spark storage systems using SQL and ETL to acquire and migrate data. Builds models spanning statistical, econometric, network/social network, natural language processing, machine learning, genetic algorithms, and neural networks, validating against KPIs and production constraints.
Ideal Candidate
The ideal candidate is a Data Scientist II who has 1+ year of experience building statistical and machine learning models on large datasets. They can write SQL scripts for analysis and data migration, acquire data using SQL/ETL, and model using tools such as R, Python, or MATLAB while validating models against business KPIs.
Must-Have Skills
building statistical models and machine learning models using large datasets from multiple resourceswriting SQL scripts for analysis and data migrationapplying specialized modelling software including RPythonor MATLABacquiring data by building the necessary SQL / ETL queriesimport processes through various company specific interfaces for accessing OracleRedShiftand Spark storage systems
Tools & Platforms
SQLETLOracleRedShiftSparkRPythonMATLAB
Required Skills
SQLETLOracleRedShiftSparkunivariate distributionsbivariate relationshipstransformationsanomaly investigationstatistical modelingeconometric modelingnetwork modelingsocial network modelingnatural language processingmachine learning algorithmsgenetic algorithmsneural networksmodel validationkey performance indicatorsRPythonMATLABdata migration
Hard Skills
scalable and reliable approaches to support or automate decision makingdata science techniques and toolsSQL queriesETL queriesOracleRedShiftSparkdata acquisition through company specific interfacesunivariate distributionsbivariate relationshipsdata transformationsanomaly investigationstatistical modelingmathematical modelingeconometric modelingnetwork modelingsocial network modelingnatural language processingmachine learning algorithmsgenetic algorithmsneural networksmodel validationcomputational demands evaluationaccuracy evaluationreliability evaluationkey performance indicatorsRPythonMATLABsocial network modelinggenetic algorithms
Soft Skills
stakeholder relationship buildingcollaboration with stakeholders and counterpartscommunicationproblem solving when solution approach is unclear
Keywords for Your Resume
Data Scientist IIData Scientistdecision makingautomate decision makingscalable and reliableSQLETLOracleRedShiftSparkunivariate distributionsbivariate relationshipstransformationsanomaliesstatistical modelingmathematical modelingeconometric modelingnetwork modelingsocial network modelingnatural language processingmachine learning algorithmsgenetic algorithmsneural networksmodel validationkey performance indicatorscomputational demandsaccuracyreliabilityRPythonMATLABdata migrationand Spark storage systemsImport processes
Deal Breakers
Must have 1 year of experience building statistical and machine learning models using large datasets from multiple resources, Must have 1 year of experience writing SQL scripts for analysis and data migration, Must have 1 year of experience applying specialized modeling software including R, Python, or MATLAB
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile