NOC Systems Engineer

Khipu Networks

KHIPU Networks have an exciting opportunity for a NOC Systems Engineer to join the team.

KHIPU Networks is an award-winning, and highly successful Cyber Security company based in the UK and Africa. We offer outstanding opportunities for candidates of all levels within a dynamic and flexible working environment. Founded in 2005, KHIPU Networks ethos has always been to work in partnership with customers, to understand their environments and challenges so that we can design and deploy Best of Breed solutions that enable them to meet their strategic goals.

KHIPU Networks is a privately owned award-winning international Cyber Security Company delivering a wide range of network, wireless and security solutions, technologies and services across multiple sectors. We are honoured to be by Royal Appointment to His Majesty the King, and are proud to be an ISO9001, ISO27001, ISO14001 and ISO45001 certified company. We also support the National Apprenticeship Scheme, employing and developing apprentices within all sectors of the company.

About the role

The NOC Systems Engineer role within KHIPU is responsible for the administration, optimisation, and continuous evolution of the monitoring platforms supporting KHIPUs MSP and NOC services.

With a primary focus on PRTG Network Monitor and associated tooling, the role ensures proactive detection, enhanced visibility, and operational intelligence across customer and internal environments.

The successful candidate will drive monitoring maturity, reduce alert noise, and improve service outcomes through automation, standardisation, and the adoption of AI-driven monitoring techniques.

Responsibilities

Monitoring & Platform Administration

  • Manage the monitoring platform (currently PRTG), including sensors, probes, alerts, and reporting
  • Design and implement monitoring standards, templates, and onboarding processes
  • Maintain monitoring baselines, thresholds, and escalation rules aligned to SLAs
  • Ensure high availability, performance, and resilience of the monitoring platform

Service Optimisation & Observability

  • Continuously improve monitoring coverage across network, infrastructure, and services
  • Reduce alert fatigue through intelligent tuning, correlation, and dependency mapping
  • Introduce predictive monitoring and anomaly detection techniques to enhance service quality

AI & Automation

  • Drive adoption of AI-assisted monitoring capabilities for alert prioritisation, correlation, and root cause insight
  • Leverage automation to reduce manual effort and improve response times
  • Explore and integrate AIOps and emerging observability tooling to evolve KHIPUs managed service offering
  • Support development of self-healing and auto-remediation workflows

Event Management & Integration

  • Improve event management processes to ensure actionable and meaningful alerts
  • Integrate monitoring with ITSM (e.g., Halo PSA), notification, and automation platforms
  • Develop correlation and prioritisation models to enhance operational efficiency

Data, Reporting & Insight

  • Develop dashboards and reporting for customers and internal stakeholders
  • Provide insights into performance, availability, trends, and risks
  • Translate technical data into business-relevant reporting and proactive recommendations

Technical Operations Support

  • Support the NOC in incident triage and root cause identification
  • Assist with onboarding customers/services into monitoring
  • Perform upgrades, maintenance, and troubleshooting of monitoring systems

Documentation & Process

  • Maintain runbooks, onboarding guides, and knowledge base content
  • Ensure monitoring adheres to KHIPU standards and best practices
  • Continuously improve operational procedures and governance

Leadership Responsibilities

Team & NOC Enablement

  • Act as SME for monitoring platforms and observability practices
  • Support and mentor NOC engineers
  • Assist in training and development of monitoring capabilities

Operational Excellence

  • Drive consistency in alert handling and incident response
  • Promote clear, actionable, and timely communication standards
  • Support continuous service improvement initiatives

Technology Leadership

  • Stay current with monitoring, automation, and AI/observability trends
  • Recommend improvements to tooling, integrations, and architecture
  • Contribute to the evolution of KHIPUs NOC and managed services capability

Qualifications and Education Requirements

  • Diploma or higher tertiary education

    Certifications (advantageous)

  • PRTG Certified Engineer
  • ITIL Foundation
  • Microsoft / Networking certifications

Preferred Skills

  • Strong experience with monitoring platforms (PRTG, SolarWinds, LogicMonitor or similar)
  • Solid understanding of networking and infrastructure monitoring
  • Experience with scripting/automation (PowerShell, Python or similar)
  • Knowledge of SNMP, WMI, NetFlow, Syslog, and APIs
  • Exposure to AI/ML concepts in monitoring (AIOps)
  • Excellent written and spoken English
  • Strong communication and stakeholder engagement skills

How to apply

To apply for this job you need to authorize on our website. If you don't have an account yet, please register.