ETRM Data Scientist

Or refer someone

Job Openings ETRM Data Scientist

About the job ETRM Data Scientist

Job Description:

Education Requirements:

o Masters degree in mathematics, Statistics, Data Science, or related fields is

mandatory.

o A Ph.D. in Mathematics, Statistics, Data Science, or similar areas is preferred but

not mandatory.

Mandatory skills:

o Data Science:

Extensive experience in time-series forecasting, predictive modelling, and

deep learning.

Proficient in designing reusable and scalable machine learning systems.

Proficiency in implementing techniques such as ARIMA, LSTM, Prophet,

Linear Regression, and Random Forest to ensure accurate forecasting

and insights.

Strong command of machine learning libraries, including scikit-learn,

XGBoost, Darts, TensorFlow, and PyTorch, along with data manipulation

tools like Pandas and NumPy.

Proven expertise in designing and implementing explicit ensemble

techniques such as stacking, boosting and bagging to improve model

accuracy and robustness.

Proven track record of analysing and optimizing performance of

operational machine learning models to ensure long-term efficiency andreliability.

Expertise in retraining and fine-tuning models based on evolving data

trends and business requirements.

o MLOps Implementation:

Proficiency in leveraging Python-based MLOps frameworks for

automating machine learning pipelines, including model deployment,

monitoring, and periodic retraining.

Advanced experience in using the Azure Machine Learning Python SDK

to design and implement parallel model training workflows, incorporating

distributed computing, parallel job execution, and efficient handling of

large-scale datasets in managed cloud environments.

o PySpark Proficiency

Strong experience in PySpark for scalable data processing and analytics.

o Azure Expertise:

Azure Machine Learning: Managing parallel model training, deployment,

and operationalization using the Python SDK.

Azure Databricks: Collaborating on data engineering and analytics tasks

using PySpark/Python.

Azure Data Lake: Implementing scalable storage and processing

solutions for large datasets.

Preferred skills:

o K-Means Clustering: Experience in applying k-means clustering for data

segmentation and pattern identification.

o Bottom-Up Forecasting: Skilled in creating granular bottom-up forecasting

models for hierarchical insights.

o Azure Data Factory : Designing, orchestrating, and managing pipelines for

seamless data integration and processing.

o knowledge of power trading concepts.

o Generative AI (GenAI): Experience in applying generative AI models, such as

GPT or similar frameworks.

Or refer someone