Location: Bangalore / Vadodara / Ahmedabad
Job Type: Full Time / Onsite
Department: IT Infrastructure
Shift: Rotational Shift
Job Summary:
We are seeking an experienced and highly skilled Lead LogicMonitor Administrator to architect, deploy, and manage scalable observability solutions across hybrid IT environments. This role demands deep expertise in LogicMonitor and a strong understanding of modern IT infrastructure and application ecosystems, including on-premises, cloud-native, and hybrid environments.
The ideal candidate will play a critical role in designing real-time service availability dashboards, optimizing performance visibility, and ensuring comprehensive monitoring coverage for business-critical services
Key Responsibilities:
- Monitoring Architecture & Implementation
- Serve as the subject matter expert (SME) for LogicMonitor, overseeing design, implementation, and continuous optimization.
- Lead the development and deployment of monitoring solutions that integrate onpremise infrastructure, public cloud (AWS, Azure, GCP), and hybrid environments.
- Develop and maintain monitoring templates, escalation chains, and alerting policies that align with business service SLAs.
- Real-Time Dashboards & Visualization
- Design and build real-time service availability dashboards to provide actionable insights for operations and leadership teams.
- Leverage LogicMonitor’s APIs and data sources to develop custom visualizations, ensuring a single-pane-of-glass view for multi-layered service components.
- Collaborate with applications and service owners to define KPIs, thresholds, and health metrics.
- Automation & Integration
- Automate onboarding/offboarding of monitored resources using LogicMonitor’s REST API, Groovy scripts, and Configuration Modules.
- Integrate LogicMonitor with ITSM tools (e.g., ServiceNow, Jira), collaboration platforms (e.g., Slack, Teams), and CI/CD pipelines.
- Enable proactive monitoring through synthetic transactions and anomaly detection capabilities
- Operations & Optimization
- Perform ongoing health checks, capacity planning, and tuning of monitoring thresholds to reduce alert fatigue.
- Establish and enforce monitoring standards, best practices, and governance models across the organization.
- Lead incident response investigations, root cause analysis, and post-mortem reviews from a monitoring perspective.
Candidate Requirements:
- Education: Bachelor’s degree in computer science, Information Technology, or a related field.
- 5+ years of hands-on experience with LogicMonitor, including custom DataSources, PropertySources, dashboards, and alert tuning
- Proven expertise in IT infrastructure monitoring: networks, servers, storage, virtualization (VMware), and containerization (Kubernetes, Docker).
- Technical Proficiency:
- Strong understanding of cloud platforms (AWS, Azure) and their native monitoring tools (e.g., CloudWatch, Azure Monitor).
- Experience in scripting and automation (e.g., Python, PowerShell, Groovy, Bash).
- Familiarity with observability stacks: ELK, Grafana is a strong plus.
- Proficient with ITSM and incident management processes, including integrations with ServiceNow.
- Excellent problem-solving, communication, and documentation skills.
- Preferred Qualifications:
- LogicMonitor Certified Professional (LMCP) or similar certification.
- Experience with APM tools (e.g., AppDynamics, Dynatrace, Datadog) and log analytics platforms.
- Knowledge of DevOps practices and CI/CD pipelines.
- Exposure to regulatory/compliance monitoring (e.g., HIPAA, PCI, SOC 2)