Amrendra Singh
Platform-Ops Engineer with 2+ years in multi-cloud ops (AWS, Azure, GCP). Skilled in monitoring, incident management, and 24x7 support. Certified in AZ-900, MS-900 & Google Cloud Digital Leader.
Cloud Operations & NOC
---
Summary of Career
1. Platform-Ops Engineer with 2+ years of hands-on experience in Cloud Operations and NOC engineering, ensuring 24x7x365 availability, performance, and scalability of SaaS-based environments across AWS, Azure, and GCP.
2. Spearheaded real-time monitoring and incident response using tools like ServiceNow, Site24x7, Prometheus, and Grafana, achieving 99% SLA compliance on critical incidents through effective triage and root cause analysis.
3. Mitigated $250K+ in annual cloud overspending by automating cost anomaly detection, analyzing usage trends, and collaborating with finance teams to optimize cloud resource utilization.
4. Led backup operations using Metallic and Azure Native Backup, attaining 99.9% success rates and reducing failures by 30% through detailed troubleshooting playbooks and SOP documentation.
5. First-line support for global customers, managing over 20+ daily inquiry calls and incident tickets (RITMs), including user access management, MFA resets, and security group configurations via Azure PIM and CMP.
6. Authored and standardized operational documentation, reducing incident resolution time by 20% and improving team efficiency through structured SOPs and onboarding programs for new hires.
7. Ensured endpoint compliance (100%) with Manage Engine Endpoint Central MSP, validating patch cycles, enforcing hardening policies, and maintaining continuous system security.
8. Certified in Azure Fundamentals (AZ-900), Microsoft 365 Fundamentals (MS-900), and Google Cloud Digital Leader, demonstrating cross-platform cloud proficiency.
9. Selected among India’s top candidates for SoftwareONE Academy, received Pre-Placement Offer (PPO) within 3 months, and ranked in top 2% of the cohort by excelling in cloud technologies and strategic delivery.
10. Award-winning contributor in global and national competitions, including United Nations representation, Olympiads (top 0.01%), and government-led STEM initiatives; showcasing leadership, coordination, and academic excellence.
Experience as Associate Analyst
Vaco Binary Semantics LLP
20 May 2022
23 June 2023
-Monitor system health, perform deep dives into core systems, and utilize diagnostic tools to identify recurring patterns and optimize system performance.
- Manage high-priority incidents following established processes, collaborating with cross-functional teams during critical events to ensure timely resolution.
-Maintain detailed technical documentation, update SOPs with new learnings, and develop troubleshooting guides to ensure operational consistency and team knowledge growth.
- Troubleshoot and resolve intricate system-level problems through in-depth root cause analysis, while adhering to and improving SOPs; escalate to on-call teams as required.
Experience as Associate Expert Managed Operation: Software & Cloud
SoftwareONE
17 July 2023
To date
-Spearheaded 24*7 monitoring of customer environments for alerts & events using Site24x7 and ServiceNow, resolving 99% of critical incidents within SLA timelines through rapid triaging, root cause analysis, and workaround implementation.
-Pioneered Azure/AWS cost anomaly detection processes, mitigating $250K+ in annual overspending by analyzing spend trends, escalating fraud alerts, and partnering with finance teams to optimize resource utilization. Automated monthly cloud cost reports, enhancing stakeholder visibility and enabling data-driven budget decisions.
-Orchestrated backup operations via Metallic and Azure Native Backup, achieving 99.9% job success rates. Authored troubleshooting playbooks that reduced backup failures by 30%. Ensured 100% endpoint compliance with Manage Engine Endpoint Central MSP by validating patching cycles, updating configurations, and submitting change requests for system hardening.
- Thriving in a proactive environment as the first line of support for all incoming Incident & RITM tickets and responding to 20+ inquiry calls from customers by working non-standard hours, including on-call rotation during weekends and holidays, ensuring uninterrupted service and incident resolution.
-Spearheaded end-to-end ticket management processes, including initial assessment, triage, and timely routing to appropriate service lines, ensuring alignment with issue severity and complexity. Ensured strict adherence to Operational Level Agreements (OLAs) and Service Level Agreements (SLAs) across service lines, maintaining accountability for resolution timelines and operational efficiency.
-Monitored ticket progression, identifying and escalating high-priority or sensitive issues to relevant stakeholders for expedited resolution.
Facilitated cross- functional collaboration by engaging stakeholders at all levels to address escalations, mitigate risks, and drive timely solutions.
-First line troubleshooting responsibilities include managing role assignments via PIM (Privileged Identity Management) for access control, adding/removing users to security groups to enforce permissions and policies, resetting MFA (Multi-Factor Authentication) to resolve user access issues, and onboarding members to the CMP (Customer Management Portal) to streamline resource access.
-Championed a knowledge-sharing initiative by developing comprehensive documentation, including Standard Operating Procedures (SOPs), to standardize workflows and improve operational clarity, resulting in a 20% reduction in incident resolution time. Independently led onboarding and training programs for 12+ new hires in a fast-paced, global, cross-functional environment, reducing ramp-up time by 25% and accelerating team productivity.
Matric 07 July 2016
Math's, Physics, Chemistry & English.Intermediate 07 July 2018
Math's, Physics, Chemistry & English.Bachelor 07 July 2022
Veer Bahadur Singh Purvanchal University 7.17/10 CGPA, Relevant Coursework: Operating Systems, DBMS, Artificial Intelligence, Computer Networks, Cryptography and Network Security. Bestowed by Uttar Pradesh Post-Matric Scholarship consecutively for 4 years (2018–2022) in recognition of consistent academic excellence.