Glue semantics Jobs in Pune

2+ Glue semantics Jobs in Pune | Glue semantics Job openings in Pune

Apply to 2+ Glue semantics Jobs in Pune on CutShort.io. Explore the latest Glue semantics Job opportunities across top companies like Google, Amazon & Adobe.

ML Engineer II - Aws, Aws Cloud

Global Digital Transformation Solutions Provider

Agency job

via Peak Hire Solutions by Dhara Thakkar

Pune

6 - 12 yrs

₹25L - ₹30L / yr

Machine Learning (ML)

AWS CloudFormation

Online machine learning

Amazon Web Services (AWS)

ECS

+20 more

MUST-HAVES:

Machine Learning + Aws + (EKS OR ECS OR Kubernetes) + (Redshift AND Glue) + Sage maker
Notice period - 0 to 15 days only
Hybrid work mode- 3 days office, 2 days at home

SKILLS: AWS, AWS CLOUD, AMAZON REDSHIFT, EKS

ADDITIONAL GUIDELINES:

Interview process: - 2 Technical round + 1 Client round
3 days in office, Hybrid model.

CORE RESPONSIBILITIES:

The MLE will design, build, test, and deploy scalable machine learning systems, optimizing model accuracy and efficiency
Model Development: Algorithms and architectures span traditional statistical methods to deep learning along with employing LLMs in modern frameworks.
Data Preparation: Prepare, cleanse, and transform data for model training and evaluation.
Algorithm Implementation: Implement and optimize machine learning algorithms and statistical models.
System Integration: Integrate models into existing systems and workflows.
Model Deployment: Deploy models to production environments and monitor performance.
Collaboration: Work closely with data scientists, software engineers, and other stakeholders.
Continuous Improvement: Identify areas for improvement in model performance and systems.

SKILLS:

Programming and Software Engineering: Knowledge of software engineering best practices (version control, testing, CI/CD).
Data Engineering: Ability to handle data pipelines, data cleaning, and feature engineering. Proficiency in SQL for data manipulation + Kafka, Chaos search logs, etc. for troubleshooting; Other tech touch points are Scylla DB (like BigTable), OpenSearch, Neo4J graph
Model Deployment and Monitoring: MLOps Experience in deploying ML models to production environments.
Knowledge of model monitoring and performance evaluation.

REQUIRED EXPERIENCE:

Amazon SageMaker: Deep understanding of SageMaker's capabilities for building, training, and deploying ML models; understanding of the Sage maker pipeline with ability to analyze gaps and recommend/implement improvements
AWS Cloud Infrastructure: Familiarity with S3, EC2, Lambda and using these services in ML workflows
AWS data: Redshift, Glue
Containerization and Orchestration: Understanding of Docker and Kubernetes, and their implementation within AWS (EKS, ECS)

MUST-HAVES:

Machine Learning + Aws + (EKS OR ECS OR Kubernetes) + (Redshift AND Glue) + Sage maker
Notice period - 0 to 15 days only
Hybrid work mode- 3 days office, 2 days at home

SKILLS: AWS, AWS CLOUD, AMAZON REDSHIFT, EKS

ADDITIONAL GUIDELINES:

Interview process: - 2 Technical round + 1 Client round
3 days in office, Hybrid model.

CORE RESPONSIBILITIES:

The MLE will design, build, test, and deploy scalable machine learning systems, optimizing model accuracy and efficiency
Model Development: Algorithms and architectures span traditional statistical methods to deep learning along with employing LLMs in modern frameworks.
Data Preparation: Prepare, cleanse, and transform data for model training and evaluation.
Algorithm Implementation: Implement and optimize machine learning algorithms and statistical models.
System Integration: Integrate models into existing systems and workflows.
Model Deployment: Deploy models to production environments and monitor performance.
Collaboration: Work closely with data scientists, software engineers, and other stakeholders.
Continuous Improvement: Identify areas for improvement in model performance and systems.

SKILLS:

Programming and Software Engineering: Knowledge of software engineering best practices (version control, testing, CI/CD).
Data Engineering: Ability to handle data pipelines, data cleaning, and feature engineering. Proficiency in SQL for data manipulation + Kafka, Chaos search logs, etc. for troubleshooting; Other tech touch points are Scylla DB (like BigTable), OpenSearch, Neo4J graph
Model Deployment and Monitoring: MLOps Experience in deploying ML models to production environments.
Knowledge of model monitoring and performance evaluation.

REQUIRED EXPERIENCE:

Amazon SageMaker: Deep understanding of SageMaker's capabilities for building, training, and deploying ML models; understanding of the Sage maker pipeline with ability to analyze gaps and recommend/implement improvements
AWS Cloud Infrastructure: Familiarity with S3, EC2, Lambda and using these services in ML workflows
AWS data: Redshift, Glue
Containerization and Orchestration: Understanding of Docker and Kubernetes, and their implementation within AWS (EKS, ECS)

AWS Data Engineer

at Deqode

1 recruiter

Posted by Alisha Das

Pune, Mumbai, Bengaluru (Bangalore), Chennai

4 - 7 yrs

₹5L - ₹15L / yr

Amazon Web Services (AWS)

Python

PySpark

Glue semantics

Amazon Redshift

+1 more

Job Overview:

We are seeking an experienced AWS Data Engineer to join our growing data team. The ideal candidate will have hands-on experience with AWS Glue, Redshift, PySpark, and other AWS services to build robust, scalable data pipelines. This role is perfect for someone passionate about data engineering, automation, and cloud-native development.

Key Responsibilities:

Design, build, and maintain scalable and efficient ETL pipelines using AWS Glue, PySpark, and related tools.
Integrate data from diverse sources and ensure its quality, consistency, and reliability.
Work with large datasets in structured and semi-structured formats across cloud-based data lakes and warehouses.
Optimize and maintain data infrastructure, including Amazon Redshift, for high performance.
Collaborate with data analysts, data scientists, and product teams to understand data requirements and deliver solutions.
Automate data validation, transformation, and loading processes to support real-time and batch data processing.
Monitor and troubleshoot data pipeline issues and ensure smooth operations in production environments.

Required Skills:

5 to 7 years of hands-on experience in data engineering roles.
Strong proficiency in Python and PySpark for data transformation and scripting.
Deep understanding and practical experience with AWS Glue, AWS Redshift, S3, and other AWS data services.
Solid understanding of SQL and database optimization techniques.
Experience working with large-scale data pipelines and high-volume data environments.
Good knowledge of data modeling, warehousing, and performance tuning.

Preferred/Good to Have:

Experience with workflow orchestration tools like Airflow or Step Functions.
Familiarity with CI/CD for data pipelines.
Knowledge of data governance and security best practices on AWS.

Job Overview:

Key Responsibilities:

Design, build, and maintain scalable and efficient ETL pipelines using AWS Glue, PySpark, and related tools.
Integrate data from diverse sources and ensure its quality, consistency, and reliability.
Work with large datasets in structured and semi-structured formats across cloud-based data lakes and warehouses.
Optimize and maintain data infrastructure, including Amazon Redshift, for high performance.
Collaborate with data analysts, data scientists, and product teams to understand data requirements and deliver solutions.
Automate data validation, transformation, and loading processes to support real-time and batch data processing.
Monitor and troubleshoot data pipeline issues and ensure smooth operations in production environments.

Required Skills:

5 to 7 years of hands-on experience in data engineering roles.
Strong proficiency in Python and PySpark for data transformation and scripting.
Deep understanding and practical experience with AWS Glue, AWS Redshift, S3, and other AWS data services.
Solid understanding of SQL and database optimization techniques.
Experience working with large-scale data pipelines and high-volume data environments.
Good knowledge of data modeling, warehousing, and performance tuning.

Preferred/Good to Have:

Experience with workflow orchestration tools like Airflow or Step Functions.
Familiarity with CI/CD for data pipelines.
Knowledge of data governance and security best practices on AWS.

Get to hear about interesting companies hiring right now

Follow Cutshort

Why apply via Cutshort?

Connect with actual hiring teams and get their fast response. No spam.

Find more jobs

Get to hear about interesting companies hiring right now

Follow Cutshort