11+ System monitoring Jobs in Mumbai | System monitoring Job openings in Mumbai
Apply to 11+ System monitoring Jobs in Mumbai on CutShort.io. Explore the latest System monitoring Job opportunities across top companies like Google, Amazon & Adobe.
This role is for Work from the office.
Job Description
Roles & Responsibilities
- Work across the entire landscape that spans network, compute, storage, databases, applications, and business domain
- Use the Big Data and AI-driven features of vuSmartMaps to provide solutions that will enable customers to improve the end-user experience for their applications
- Create detailed designs, solutions and validate with internal engineering and customer teams, and establish a good network of relationships with customers and experts
- Understand the application architecture and transaction-level workflow to identify touchpoints and metrics to be monitored and analyzed
- Analytics and analysis of data and provide insights and recommendations
- Constantly stay ahead in communicating with customers. Manage planning and execution of platform implementation at customer sites.
- Work with the product team in developing new features, identifying solution gaps, etc.
- Interest and aptitude in learning new technologies - Big Data, no SQL databases, Elastic Search, Mongo DB, DevOps.
Skills & Experience
- At least 2+ years of experience in IT Infrastructure Management
- Experience in working with large-scale IT infra, including applications, databases, and networks.
- Experience in working with monitoring tools, automation tools
- Hands-on experience in Linux and scripting.
- Knowledge/Experience in the following technologies will be an added plus: ElasticSearch, Kafka, Docker Containers, MongoDB, Big Data, SQL databases, ELK stack, REST APIs, web services, and JMX.
Job Responsibilities:
- Server Monitoring: Monitor server performance and respond to alerts promptly. Troubleshoot and resolve system and network issues.
- Server Disk Cleanup: Regularly analyze and clean up server storage to optimize performance and resource allocation.
- Deployment of New Linux Instances: Create and configure new Linux server instances based on project requirements. Ensure proper security measures and updates during deployment.
- DNS Management: Manage DNS records, domains, and configurations for internal and external services.
- Backing up/Archiving Logs: Implement backup and archiving strategies for server logs to maintain data integrity and facilitate auditing.
- MySQL Backups: Create and manage automated backups of MySQL databases to ensure data integrity. Develop disaster recovery plans.
- System Administration Strategies: Develop and implement system administration strategies to enhance server performance, security, and scalability. Stay updated with industry best practices and emerging technologies
Qualifications:
- Bachelor's degree in Computer Science, Information Technology, or a related field (or equivalent work experience).
- Proven experience as a Linux Administrator or similar role (5+ years).
- Proficiency in Linux server administration (CentOS, Ubuntu, etc.).
- Strong knowledge of: Server monitoring tools (e.g., Nagios, Zabbix). Configuration management tools (e.g., Ansible, Puppet). Virtualization and containerization (e.g., Docker, Kubernetes). DNS management (e.g., BIND, AWS Route 53). Backup solutions (e.g., Bacula, AWS Backup). Log management and analysis (e.g., ELK Stack, Graylog). Database administration (e.g., MySQL, MariaDB). Security best practices and firewall configuration. Scripting languages (e.g., Bash, Python). Version control systems (e.g., Git). Cloud platforms (e.g., AWS, Azure, Google Cloud). Monitoring and alerting tools (e.g., Grafana, Prometheus). Networking concepts (TCP/IP, routing). System administration strategies and documentation. Excellent problem-solving skills and the ability to work independently.
- Strong communication and teamwork skills.
Main tasks
- Implementation, operating and monitoring of LS system solutions on-premise and in the cloud
- Analysis and sustainable troubleshooting of system failures
- Support in the area of information and IT security
Qualification profile
- Successfully completed training or studies in the field of IT
- Experience in operating on-premise, public and private cloud solutions
- Young professionals are welcome
- Good English skills
- Knowledge in the following areas
- Oracle Database
- Oracle Application Server
- Oracle Linux
- Oracle Linux KVM
- Oracle Weblogic
Experience Install, Configure, Maintain & Lifecycle Management of Linux Systems | ||||||||||||||
Experience in Building & Maintaining File Systems, Storage Volumes including LVM concepts, LUNs, Swap etc | ||||||||||||||
Expert hands on Knownledge on Bash/Shell or Python scripting | ||||||||||||||
Build and maintain container orchestration platform (Docker) using Kubernetes, Openshift, Micro k8s or similar | ||||||||||||||
Develop and maintain automated processes, tools, and documentation in support of Docker and Kubernetes container orchestration platform | ||||||||||||||
Ability to perform automated infrastructure code test, integration, deployment, and assurance using DevOps and CI/CD methodologies for e.g. Jenkins, Gitlab is a plus | ||||||||||||||
Automation/Configuration Management with Ansible or Puppet and scripting | ||||||||||||||
Strong background in linux networking ( ip, iptables, ipsec ) | ||||||||||||||
Strong knowledge on configuration of diverse subsystems (systemd, printers, graphic adapters, networking, SELinux, firewalls…etc) | ||||||||||||||
Strong knowledge on Storage Concepts - NAS, NFS, SAN, RAID, ZFS | ||||||||||||||
Knowledge on creation and maintenance of repositories (intranet and internet) | ||||||||||||||
Hands on with Installation procedures: yum, kickstart, anaconda, plymouth | ||||||||||||||
UEFI/ legacy BIOS: How to boot, how to make a medium bootable, handling of keys etc. | ||||||||||||||
Strong Knowledge of wide variety of open source technologies/tools and cloud services | ||||||||||||||
Strong Knowledge of best practices and IT operations in an always-up, always-available service | ||||||||||||||
Experience in Server Virtualisation Technology mainly in VMware vSphere, vCenter, vSAN etc, Proxmox is a plus | ||||||||||||||
Experience in Setting Up DNS Bind and LDAP Directory Services is a plus | ||||||||||||||
Experience in using Atlassian toolchains – Jira & Confluence is a plus | ||||||||||||||
Knowledge on Ceph Storage technology is a plus Roles & Responsibilities
|
- Design, develop, implement and maintain our core Oracle applications.
- Maintain our SQL / PLSQL processes.
- Ensure DB availability.
- Proactively manage and maintain security standards and controls by creating storage database structures with high-level security features.
- Actively seek to optimise and simplify our architecture.
- Take ownership of performance and capacity monitoring aspects of the DB.
- Execution of data migration jobs and scripts as required.
- Assist the infrastructure team in sizing hardware used for the DB.
- Support and collaborate with product developers.
- Contribute to the creation and maintenance of disaster recovery plans.
- Work closely with the application vendors, co-ordinate and manage the DBA activities.
- Own and follow up with Vendors/Stakeholders & their DBAs when required.
- Create a reliable backup strategy. Ensure database backups are appropriately
- executed and periodic restorations are exercised to ensure backup quality.
- Determine and document DB policies, procedures and standards.
- Performance testing and evaluation to ensure data security, privacy and integrity.
- Identification of bottle necks and deadlock issues.
- Ensure SLA's & operational KPI’s are met, working as necessary with internal and external support functions when major incidents occur.
- Install upgrades and security patches.
- Alter storage structures to meet the evolving needs of the company.
- Set up database user accounts.
- Train users on how to access the information in the database.
- Find and debug malfunctioning programs affecting the database integrity.
- Create autonomous database backups.
- Regularly update the database security protocols.
Desired Skills:
- Ideally Oracle Certified DB Professional (OCP) with more than 5 years’ experience with Oracle, preferably in a lead role.
- Technical degree preferable (B.E. / B. Tech).
- Must experience with an organisation that has 24x7 reliability on its database.
- Must have experience with Oracle Exadata.
- Must Operating system experience in Linux.
- Must hands on knowledge of Shell scripting on Linux Platform
- Must hands on knowledge of Oracle management tools (Data Guard, RMAN).
- Must hands on knowledge of SQL / PLSQL.
- Experience should include DB design (modelling and normalisation), capacity planning, performance tuning, storage management, back-up and recovery, managing schemas, report generation and DB clustering technologies.
- Knowledge of Partitioning.
- Knowledge of architecture design principles.
- Good problem solver, focused and dedicated developer with ability to work on
- your own as well as with a team.
- Hands on experience with DB standards and end user applications-translating
- capacity requirements into infrastructure deployment technology.
- Strong practical experience of ORACLE in a production environment (Ver 19c).
- Must be a self-starter with a strong attention to detail.
About us:
Company website: https://www.neulife.com/">https://www.neulife.com/
INNOVERTUS NUTRITION TECHNOLOGIES INDIA PVT.LTD. is the leading Indian company in the field of sports nutrition products and dietary supplements. Incorporated over a decade ago, we are pioneers in introducing the concept of sports nutrition as well as educating the mass about the need to be fit and healthy, thus facilitating and educating the mass to make a choice in investing to live a healthy lifestyle.
Are you ready for a challenging and exciting endeavor that will require the investment of a lot of hard work, dedication and all your experience?
Are you ready to bring your skills, competencies, and experience to this job? Do you have a solid understanding of the software & hardware? You might be exactly the new team member we are looking for.
Your responsibilities
Troubleshoot hardware, software, and other IT solutions:
- Able to work any shift
- Able to work on-call shifts as required
- Able to work independently to support, break-fix hardware/software and other IT solutions
- Manage AWS infrastructure for production website.
- Setup automated ami creation, cloudwatch,
- Manage the planning and implementation of information systems security, anti-virus.
- Checking and negotiating the IT aspects of any contracts with any external Vendors.
- Perform cost-benefit and return on investment analyses for proposed systems to aid management in making implementation decisions
- Manage and ensure effectiveness of security solutions, including firewalls.
- Deployed CCTV and Biometric device on Warehouse.
- Implementing centralize Anti-Virus policy for Warehouse and Store to block unnecessary site as well external device.
- Database Administration activity for all the production databases - Uptime of databases, Information access times, Effective data management.
- Database tuning, performance monitoring and troubleshooting.
Has good communication and customer service skills:
- Ability to communicate to all levels of management
- Escalate incidents cases to appropriate teams when necessary
- Ability to add, change or modify new or existing reports
- Document and discuss complex end user compute solutions in a simple language that is understandable at all levels both within and outside.
- Ability to work effectively across departmental boundaries.
Design, develop, and/or update and implement team standards, processes, and documentation:
- Monitor, correct, and report observed infractions of security policies and procedures
- Assist in the design, develop, and/or update and implement team standards, processes, and documentation
- Manage system status, taking steps to improve performance, and reliability as directed and per established policies and procedures
- Provide assistance to other technical teams (hands-on support)
- Linux & Windows OS Installation & troubleshooting, Linux patching
- Designing AWS architecture, Creating Custom VPC & launching EC2 instances
- Configuring NACL, SG, Route Table as per client requirement
- Peering VPC across region, Creating and modifying AMI
- Setting Elastic Load Balancers & attaching EC2 instances under it
- AWS- VPC, IAM, EC2, Route 53, S3, Elastic Load Balancer, Cloud Watch
- User and group administration.
Education and/or Experience:
- Master’s/ Bachelor’s degree (Computer Science, IT)
- 4+ years of work experience as System Administrator & AWS with good understanding of the subject.
Benefits:
- Flexible work hours, Work from home and holidays policy
- Open and collaborative work culture
- Competitive salary
Please apply with below details in your resume:
- Current CTC
- Expected CTC
- Earliest joining period (In days)
- Linux OS administrator
- Experience with WHM / Cpanel
- Experience working on MS Azure.
- Experience with handling domain names and registrars
- Experience with backups
- Knowledge of cloud and dedicated computing
- Experience in setting up OS, Web server (Apache), App server (Tomcat) and its management
- Knowledge of Data security and have implemented OS security measures, deployment of security patches/ upgrades
- Knowledge of networking and cloud technologies
- Deployment of software releases, release management
- Maintain UNIX/Linux Operating System
- Create and maintain environment for running batch jobs associated with daily batch cycle and batch reporting subsystem
- Work with LAN/Network personnel to ensure compatibility with LAN applications and peripheral hardware to provide end users with reliable and stable working environment
- Support and maintain other vendor database software installed on the LAN servers
- Perform User Access Management.
- Experience supporting day-today administration functions including user account management and script creation
- Apply Patches and Upgrades as necessary. Perform tasks for Backup and Recovery Management including High Availability
- Install and configure storage arrays and allocate SAN and NAS storage to different OS platforms and/or administer ZFS storage pools, file system, snapshots, and clones.
- Experience in Linux hypervisors KVM, XEN.
- Good knowledge on open stack administration and service operations.
- Good experience on veritas net backup software.
- Strong experience in public clouds AWS, AZURE, GCP.
- Solid knowledge of protocols such as DNS, HTTP, LDAP, SMTP and SNM
- Good understanding on AWS outposts hybrid environment.
- Good experience in the windows server and OS side.
- Experience in Monitoring platforms like Zabbix, ELK, Grafana.
- Troubleshoot Hardware Issues, Installation, and testing of computer peripherals
- Perform/Implement Security Monitoring and audit to identify any possible security intrusions or breaches.
- Collaborate with other teams and team members to develop automation strategies and deployment processes
Qualifications:
- Degree in Computer Science, Computer Engineer, or other related courses
- Minimum of 10 years’ experience
- In-depth knowledge of Linux: RedHat, CentOS, Debian, etc.
- In-depth knowledge in Linux virtualization
- good knowledge of UNIX and LINUX Operating systems, file systems, storage environments, and networking protocols
- Knowledgeable in Unix, Linux, Unix, Linux shell scripting. Practical scripting skills in Shell, PERL, Batch, Python
- good DBA skills MySQL, MariaDB, PostgreSQL, Timescale, MongoDB
- Basic Messaging and Collaboration concepts and tools, Server Virtualization
- Practical understanding of Networking - routing, subnets, UDP, TCP, IP, and VLANs
- Backup and Storage Management (Netback up mandatory).
- Familiar and can identify or interpret crash dump files and core dump files and monitor System Logs
∙ Computer Science related Bachelor's Degree or equivalent experience
∙ Min 2 years of experience on GCP / AWS / Azure cloud system ( Overall 5+ years of experience)
∙ Familiarity with LAMP stack technologies, experience supporting
∙Must be capable of working on Linux System.
∙ Familiarity with SQL, Apache, File storage, Load Balancers and Agile methodologies ∙ Knowledge of shell scripts, Linux system administration.
∙ Strong DNS management and automation background
∙ Strong scripting (bash, php, perl) skills
∙ Good understanding of TCP/IP networking and troubleshooting
∙ Clear communication and documentation of projects and procedures
∙ Strong problem solving skills
∙ Demonstrated ability to manage timelines, dependencies, deliverables, milestones and resource allocation and management in projects.
∙Strong Cloud Architecture experience.
∙Cloud Security
∙Added Advantage with knowledge of Microservices/API/ Queue systems
∙knowledge of cloud computing technologies and current computing trends. ∙Understanding and willingness to embrace CI/CD and automation tooling such as Jenkins and GIT
Responsibilities:
∙ Set up technical server infrastructure, providing technical assistance to development teams, monitoring site performance, and troubleshooting issues when they arise.
∙ Set up and maintain a development server environment and a live server environment with a process for testing and deploying changes to live sites.
∙ Take an active role in designing, implementing and maintaining a scalable and robust enterprise server environment.
∙ Administer Apache web server, Load balancer and MySQL Server
∙ Work on System Security and IPTable configuration
∙ Optimize servers for high traffic, security, and other system issues. ∙ Evaluate and propose new or improved system architecture.
∙ Document system configuration, processes, and procedures.
∙ Share responsibility with team members for rotating on-call duties.