15+ Monitoring Jobs in Bangalore (Bengaluru) | Monitoring Job openings in Bangalore (Bengaluru)
Apply to 15+ Monitoring Jobs in Bangalore (Bengaluru) on CutShort.io. Explore the latest Monitoring Job opportunities across top companies like Google, Amazon & Adobe.
- Good experience with Batch operator in AutoSys.
- Some experience in Job Scripting will be plus.
- Good experience in Monitoring tools and troubleshooting.
- Excellent experience in Job sequencing and prioritization.
- Should have very good experience in handling escalations.
- Should have experience in Change Management/Control/Change Request (CR).
Certifications : Any certifications related to AutoSys will be a plus.
Job Title - Sr. Administrator, IT Infrastructure Storage
Job Duties -
- Administer IT storage recovery and backup systems. Perform complex provisioning, advanced maintenance, data replication, disaster recovery, data migration, and documentation.
- Participate in ongoing maintenance, utilization, availability, and security of storage infrastructure.
- Perform IT implementations, performance analysis and optimization, monitoring, problem resolution, upgrade planning and execution, and process creation and documentation.
- Analyze and work to improve the quality of services offered by IT. Participate in ongoing technology evaluations to keep up with technology trends and industry standards.
- Be able to script in Perl or Python and perform capacity planning and growth projections.
- Resolve complex IT issues as pertain to the environment and keep abreast on storage & backup technology.
- Experience working with ISILON Storage is a must-have skill.
- Good to have experience in ransomware, anomaly detection, airgap and vaulting technologies.
Education and Experience Requirements -
The position requires a Bachelor’s degree in
Computer Science, Computer Engineering or related field plus 5 years of post-
baccalaureate progressive experience in IT storage environments.
Skills Requirements- Experience must include:
- Expertise in Linux and Windows – Good to have Cloud knowledge
- Utilizing OS-enabled tools for data copy like rsync, robocopy and other tools
- Setting up NFS, SMB, snapshots, replication, SnapMirror, and SyncIQ
- Experience in NAS storage technologies like Netapp, Power scale, Isilon, Qumulo and Weka
- Dell EMC Insight IQ, DataIQ and ESRS, Netapp tools (OCUM/AIUM)
- Rubrik, Cohesity, Veeam and other comparable backup technologies
- Good to have Monitoring and Alerting like Prometheus, SolarWinds, telegraph and Grafana
Job Summary
Cloud Production Support Engineer(PSE) is responsible for fulfilling the day-to-day infrastructure and service requests from the application teams across AWS, CI/CD solutions and observability tools. You will be expected to handle production issues in collaboration with the cloud Infrastructure and application teams.
Responsibilities and Duties
- Troubleshoot production Issues: When technical issues with the cloud infrastructure components arise, PSE must act quickly to analyse the available data and find the root cause of the problem. They may then develop a solution or escalate the problem to other engineering team members while providing stakeholders with progress updates.
- Infrastructure provisioning and modification: Application teams may request to create new infrastructure or modify the existing ones in AWS based on their requirements via the ticketing tool. PSE should ensure that the required data/info is available on the ticket and provide a resolution based on the given SLA.
- Alert Management: Alerts from the observability tools will be received on multiple channels according to the notification settings. PSEs are expected to acknowledge the alerts, troubleshoot the issue, close the alert based on the given SLA, or escalate to the cloud infra/DevOps team for further diagnosis.
- Onboarding, Off-boarding and access management: Whenever an employee joins or leaves the organization, you will receive an onboarding or offboarding request.
- Prepare Technical Documentation: PSEs must prepare documentation when logging product issues, as they must note all details, including their observations, diagnoses, and action steps. Other everyday tasks include weekly reports summarising production performance, upgrade release notes, and troubleshooting guides.
- Product Improvements: Since PSEs have good exposure to the product issues, they should work closely with the PMs+EMs, pass the feedback on the product, and get the improvements/fixes included in the product roadmap.
- Adherence to SLA and timelines: PSEs should always adhere to the timelines shared with other teams for closure of fixes and deliver outcomes as per the SLA guidance agreed with business teams
- Reporting: Report & track weekly regarding SLA metrics, tickets being worked and closed by PSEs/transferred tickets. Identify and devise how productivity can be captured at the individual level and report the same monthly.
Qualifications and Skills
- Degree in Computer Science/Information Technology.
- Two years or more experience in Cloud and system administration.
- Experience troubleshooting in complex environments using monitoring tools.
- Demonstrated experience with containerisation technologies (Docker, Kubernetes, etc.)
- Hands-on experience with the most common AWS services.
Job Description:
• Drive end-to-end automation from GitHub/GitLab/BitBucket to Deployment,
Observability and Enabling the SRE activities
• Guide operations support (setup, configuration, management, troubleshooting) of
digital platforms and applications
• Solid understanding of DevSecOps Workflows that support CI, CS, CD, CM, CT.
• Deploy, configure, and manage SaaS and PaaS cloud platform and applications
• Provide Level 1 (OS, patching) and Level 2 (app server instance troubleshooting)
• DevOps programming: writing scripts, building operations/server instance/app/DB
monitoring tools Set up / manage continuous build and dev project management
environment: JenkinX/GitHub Actions/Tekton, Git, Jira Designing secure networks,
systems, and application architectures
• Collaborating with cross-functional teams to ensure secure product development
• Disaster recovery, network forensics analysis, and pen-testing solutions
• Planning, researching, and developing security policies, standards, and procedures
• Awareness training of the workforce on information security standards, policies, and
best practices
• Installation and use of firewalls, data encryption and other security products and
procedures
• Maturity in understanding compliance, policy and cloud governance and ability to
identify and execute automation.
• At Wesco, we discuss more about solutions than problems. We celebrate innovation
and creativity.
This role is for Work from the office.
Job Description
Roles & Responsibilities
- Work across the entire landscape that spans network, compute, storage, databases, applications, and business domain
- Use the Big Data and AI-driven features of vuSmartMaps to provide solutions that will enable customers to improve the end-user experience for their applications
- Create detailed designs, solutions and validate with internal engineering and customer teams, and establish a good network of relationships with customers and experts
- Understand the application architecture and transaction-level workflow to identify touchpoints and metrics to be monitored and analyzed
- Analytics and analysis of data and provide insights and recommendations
- Constantly stay ahead in communicating with customers. Manage planning and execution of platform implementation at customer sites.
- Work with the product team in developing new features, identifying solution gaps, etc.
- Interest and aptitude in learning new technologies - Big Data, no SQL databases, Elastic Search, Mongo DB, DevOps.
Skills & Experience
- At least 2+ years of experience in IT Infrastructure Management
- Experience in working with large-scale IT infra, including applications, databases, and networks.
- Experience in working with monitoring tools, automation tools
- Hands-on experience in Linux and scripting.
- Knowledge/Experience in the following technologies will be an added plus: ElasticSearch, Kafka, Docker Containers, MongoDB, Big Data, SQL databases, ELK stack, REST APIs, web services, and JMX.
As a DevOps Engineer with experience in Kubernetes, you will be responsible for leading and managing a team of DevOps engineers in the design, implementation, and maintenance of the organization's infrastructure. You will work closely with software developers, system administrators, and other IT professionals to ensure that the organization's systems are efficient, reliable, and scalable.
Specific responsibilities will include:
- Leading the team in the development and implementation of automation and continuous delivery pipelines using tools such as Jenkins, Terraform, and Ansible.
- Managing the organization's infrastructure using Kubernetes, including deployment, scaling, and monitoring of applications.
- Ensuring that the organization's systems are secure and compliant with industry standards.
- Collaborating with software developers to design and implement infrastructure as code.
- Providing mentorship and technical guidance to team members.
- Troubleshooting and resolving technical issues in collaboration with other IT professionals.
- Participating in the development and maintenance of the organization's disaster recovery and incident response plans.
To be successful in this role, you should have strong leadership skills and experience with a variety of DevOps and infrastructure tools and technologies. You should also have excellent communication and problem-solving skills, and be able to work effectively in a fast-paced, dynamic environment.
At F5, we strive to bring a better digital world to life. Our teams empower organizations across the globe to create, secure, and run applications that enhance how we experience our evolving digital world. We are passionate about cybersecurity, from protecting consumers from fraud to enabling companies to focus on innovation.
Everything we do centers around people. That means we obsess over how to make the lives of our customers, and their customers, better. And it means we prioritize a diverse F5 community where each individual can thrive.
F5 is looking for a Sr. Security Engineer with experience in building, integrating, operating, and maintaining robust security monitoring and auditing systems. F5’s Edge 2.0 platform provides global, scalable, and secure way to deploy applications. In this position, you will build and maintain monitoring and audit systems across the platform that provide necessary visibility and alerts to effectively defend the platform.
Responsibilities:
- Collaborate with software architects, security defenders, Operations, SRE, compliance experts, and business leaders to understand the logical boundaries of the systems and identify the events to monitor, audits to maintain, alerts to tweak, as well as systems to integrate with
- You will continuously hunt for areas and metrics to be added into monitoring systems for better operational visibility, incident response capability, availability, and forensics capability of the overall platform
- You will participate in the definition of processes around change and inventory management and develop solutions to audit the changes
- You will work with other teams within security organization to define communication and alerting protocols for effective and timely actions
- You will participate in defining and executing the Incident Response Plan for the platform and be responsible for providing necessary information during the response and forensics
- Demonstrate technical leadership in multiple domain areas, providing mentorship to other team members
Minimum qualifications:
- BS degree in Computer Science or equivalent with 5+ years of security operation and monitoring experience
- Experience with logging, monitoring, SIEM, dashboarding tools like AWS GuardDuty, Sumo, Grafana, SolarWinds, DataDog, Splunk, etc.
- Working knowledge of at least one Cloud Computing platform (e.g. Amazon AWS, Microsoft Azure, Google Compute etc.)
- Good understanding of how to handle logs from various systems, integrate with systems handling logs and metrics, how to setup and tune alerts based on thresholds and policies
- Hands on experience with computer programming languages and/or scripting languages such as Python, Java, Shell
- Good understanding of complexities and security challenges in large-scale distributed systems
- Working knowledge of Cloud orchestration systems such as Kubernetes, Openstack etc.
- Self-motivated and willing to delve into new areas and take on new challenges in an enthusiastic manner
- Excellent written and verbal communication skills
- Strong interpersonal, team building, and mentoring skills
at BELTECH ARTIFICIAL INTELLIGENCE PRIVATE LIMITED
The DevOps Engineer's core responsibilities include automated configuration and management
of infrastructure, continuous integration and delivery of distributed systems at scale in a Hybrid
environment.
Must-Have:
● You have 4-10 years of experience in DevOps
● You have experience in managing IT infrastructure at scale
● You have experience in automation of deployment of distributed systems and in
infrastructure provisioning at scale.
● You have in-depth hands-on experience on Linux and Linux-based systems, Linux
scripting
● You have experience in Server hardware, Networking, firewalls
● You have experience in source code management, configuration management,
continuous integration, continuous testing, continuous monitoring
● You have experience with CI/CD and related tools
* You have experience with Monitoring tools like ELK, Grafana, Prometheus
● You have experience with containerization, container orchestration, management
● Have a penchant for solving complex and interesting problems.
● Worked in startup-like environments with high levels of ownership and commitment.
● BTech, MTech or Ph.D. in Computer Science or related Technical Discipline
This position is for Oracle Golden Gate developer and DBA.
Job Description.
DB developer
This position is for Oracle Golden Gate developer and DBA position. The Candidate will implement and support Oracle Golden Gate replication components across many databases. Working closely with many application teams and implementing near real time replications with golden gate. Candidate should provide operational support as well.
Roles and responsibilities :
Configure and build Golden Gate Extracts/replicate for multiple databases.
Configure error handling for restart, logging, production support and status reporting in Golden Gate.
Identify and produce documentation of best practices.
Trouble shooting and performance tuning Golden Gate replications.
Implementing necessary monitoring scripts for Golden Gate in UNIX/PERL scripting
Provide Golden Gate solutions :
- Working with Oracle on SRs for critical issues.
- Strong oral and written communication skills along with problem solving skills.
Individual with min experience of over 5+ years on SQL Server
- Good Experience in SQL Server Installation and Configuration
- Backup and Recovery
- Security management
- Troubleshooting & Monitoring
- SSIS/SSRS/SSAS
at Alke Research private limited
At our organisation, the individual can expect to grow in their career to VP position and establish off shore units in 1 - 2 years
Database Administrator Lead - PostgreSQL
ABOUT Ashnik
Established in 2009, Ashnik is a leading open-source solutions and consulting company in South East Asia and India, headquartered in Singapore. We enable digital transformation for large enterprises through our design, architecting, and solution skills. Over 100 large enterprises in the region have acknowledged our expertise in delivering solutions using key open-source technologies. Our offerings form critical part of Digital transformation, Big Data platform, Cloud and Web acceleration and IT modernization. We represent EDB, Pentaho, Docker, Couchbase, MongoDB, Elastic, NGINX, Sysdig, Redis Labs, Confluent, and HashiCorp as their key partners in the region. Our team members bring decades of experience in delivering confidence to enterprises in adopting open source software and are known for their thought leadership.
THE POSITION
Ashnik is looking for talented and passionate people to be part of the team for an upcoming project at client location.
RESPONSIBILITIES
· Monitoring database performance
· Optimizing Queries and handle escalations
· Analyse and assess the impact and risk of low to medium risk changes on high profile production databases
· Implement security features
· DR implementation and switch over
QUALIFICATION AND EXPERIENCE
· Preferably have a working experience of 4 Years and more , on production PostgreSQL DBs.
· Experience of working in a production support environment
· Engineering or Equivalent degree
· Passion for open-source technologies is desired
ADDITIONAL SKILLS
· Install & Configure PostgreSQL, Enterprise DB
· Technical capabilities PostgreSQL 9.x, 10.x, 11.x
· Server tuning
· Troubleshooting of Database issues
· Linux Shell Scripting
· Install, Configure and maintain Fail Over mechanism
· Backup - Restoration, Point in time database recovery
· A demonstrable ability to articulate and sell the benefits of modern platforms, software and technologies.
· A real passion for being curious and a continuous learner. You are someone that invests in yourself as much as you invest in your professional relationships.
LOCATION: Bangalore & Mumbai
Experience: 7 yrs plus
Package: upto 20 LPA
Job Description
Please connect me on Linkedin or share your Resume on shrashti jain
• 8+ years of overall experience and relevant of at least 4+ years. (Devops experience has be more when compared to the overall experience)
• Experience with Kubernetes and other container management solutions
• Should have hands on and good understanding on DevOps tools and automation framework
• Demonstrated hands-on experience with DevOps techniques building continuous integration solutions using Jenkins, Docker, Git, Maven
• Experience with n-tier web application development and experience in J2EE / .Net based frameworks
• Look for ways to improve: Security, Reliability, Diagnostics, and costs
• Knowledge of security, networking, DNS, firewalls, WAF etc
• Familiarity with Helm, Terraform for provisioning GKE,Bash/shell scripting
• Must be proficient in one or more scripting languages: Unix Shell, Perl, Python
• Knowledge and experience with Linux OS
• Should have working experience with monitoring tools like DataDog, Elk, and/or SPLUNK, or any other monitoring tools/processes
• Experience working in Agile environments
• Ability to handle multiple competing priorities in a fast-paced environment
• Strong Automation and Problem-solving skills and ability
• Experience of implementing and supporting AWS based instances and services (e.g. EC2, S3, EBS, ELB, RDS, IAM, Route53, Cloudfront, Elasticache).
•Very strong hands with Automation tools such Terraform
• Good experience with provisioning tools such as Ansible, Chef
• Experience with CI CD tools such as Jenkins.
•Experience managing production.
• Good understanding of security in IT and the cloud
• Good knowledge of TCP/IP
• Good Experience with Linux, networking and generic system operations tools
• Experience with Clojure and/or the JVM
• Understanding of security concepts
• Familiarity with blockchain technology, in particular Tendermint
IT solutions specialized in Apps Lifecycle management. (MG1)
- Install, Configuration, and Tuning of the following AppDynamics Servers: Controller, Event Service Cluster, End User Monitoring, ADA, ADRUM
- Reviews system design and works to continuously improve stability and efficiencies
- Provides system backup recovery methodology and makes recommendations regarding enhancements and/or improvements
- Formulates policies, procedures, and standards relating to system management, and monitors system resource utilization
- Responsible for reducing operational downtime for critical, scheduled, and unscheduled maintenance by accelerating deployments of approved changes/fixes/updates and solutions and automate manual maintenance, deployment, diagnostic health checks, validation, and reporting
- Responsible for creating proactive and reactive monitoring methods, generating customer alerts within the Enterprise Event Management and Monitoring capability
- Skilled at user requirement gathering and can work independently to craft efficient monitoring, alarming solutions, and dashboards
- Understands the Agile process
- Build dashboards in Grafana.
- Setup independently Prometheus, Node exporters, etc
- Comfortable with PromQL.
- Ability to operationally support the underlying database as necessary
- Hands-on Java and/or .Net Development
- IT Operations and Application Support
- Application and systems performance management, measurement, and analysis.
- Deployment and configuration of complex enterprise software
- Solid understanding of Operating Systems (Linux/Windows)
- Strong understanding of built-in O/S monitoring and performance tools.
- Working with a wide variety of platforms and application stacks.
- Ability to understand new application frameworks in customer environments quickly
- Works with minimal direction as a seasoned resource
- Support customer initiatives in their transition towards modernization
- Tracks own work and backlog, familiar with Agile methodology
- Prioritize own work in accordance with user priorities and stakeholder expectations
- Communicates efficiently and effectively both written and verbal
- Reviews system design and works to continuously improve stability and efficiencies
- AppDynamics
- Java
- Linux/Windows
- Fault and Performance Monitoring Tools Administration
- Machine/Dotnet/Java agent
- Grafana, ELK
- Prometheus.
Disruptive Fintech Startup
- Monitoring collections once set up at a high frequency and ensuring reconciliations happen on the track.
- Creating and maintaining live collection dashboards for the senior management.
- Creating weekly reports for our lending partners.
- Monitoring the portfolio and escalating as necessary if collections performance is flagging based on identified indicators.
- Ensuring TAT on collections while improving the collections experience for all stakeholders.
What you need to have:
- Postgraduation required
- 2-4 yrs of experience in operations or collections in the NBFC/ fin-tech industry (mandatory).
- Experience in taking ownership and handling collections end to end (mandatory).
- Excellent verbal and written English communication (mandatory)
- Strong understanding of MS Excel (mandatory)
- Accounting/ Finance degree (bonus)
- Experience in creating reports and analytics dashboards (bonus)