Be at the heart of actionFly remote-controlled drones into enemy territory to gather vital information.

Apply Now

Director, IT Incident and Problem Management

Smarsh Founder Stephen Marsh receives Inc
Belfast
8 months ago
Applications closed

Related Jobs

View all jobs

Technology Risk Senior Manager

CAIO (Chief Artificial Intelligence Officer)

Summary

The Director of IT Incident and Problem Management is a senior leader responsible for shaping and transforming incident and problem management into a predictive and proactive discipline. You will drive a proactive, agile approach to incident response, building and leveraging AI-driven insights to enhance responsiveness and operational efficiency. Your leadership will underpin our pivot from a product to a platform-focused service, ensuring seamless, resilient service delivery that meets our high standards for reliability and customer satisfaction.

As a forward-thinking leader, you will balance traditional ITIL frameworks with modern tools and practices, such asincident.ioand FireHydrant, and embed accountability across engineering and operational teams. You will work closely with cross-functional stakeholders including Engineering, Product, and Customer Support to ensure that incidents are resolved promptly and root causes are addressed comprehensively, with the overarching goal of minimizing business impact.


How will you contribute?

  • Strategic Leadership:Provide visionary leadership to evolve our incident and problem management practices, embedding modern approaches that use AI and automation and predictive capabilities to reduce response times and predict potential issues before they impact service.
  • Accountability and Performance:Foster a culture of accountability, holding engineering teams and incident responders to high standards for incident resolution. Ensure robust tracking and reporting of incident response metrics, creating transparency and setting clear performance expectations.
  • Platform-Centric Incident Management:Drive alignment between incident/problem management and the organizations shift towards a unified platform model, ensuring that incident management processes are scalable, adaptable, and aligned with platform objectives.
  • Modern Tool Proficiency:Deploy and optimize advanced incident management platforms such asincident.ioand FireHydrant, utilizing these tools to enhance visibility, speed, and effectiveness of response across our platform. Adapt methodologies beyond traditional ITIL to remain agile and customer-focused.
  • Root Cause Analysis and Prevention:Lead comprehensive root cause analysis for major incidents, advocating a preventative stance through continuous improvement and resilience-focused practices. Apply SRE principles and drive actionable outcomes to prevent recurrence.
  • Data-Driven Insights and Reporting:Utilize data-driven insights to inform incident response strategies. Present trends, risk factors, and improvement opportunities to senior executives and stakeholders, supporting business decisions with clear, actionable metrics.

Typical Tasks:

  • Define and implement strategic roadmaps for incident and problem management, ensuring alignment with business objectives and platform goals. Regularly update practices to incorporate the latest in AI, automation, and predictive analytics.
  • Oversee major incident response efforts, ensuring fast, effective containment, resolution, and customer impact mitigation. Lead executive-level post-mortems and ensure comprehensive follow-ups.
  • Conduct and oversee in-depth root cause analyses for recurring or high-impact incidents, developing and deploying preventive measures across the platform to reduce recurrence.
  • Collaborate closely with IT operations, engineering, product, and support teams to ensure a unified approach to incident and problem resolution, with a focus on consistent customer experience.
  • Define, monitor, and optimise KPIs and performance metrics related to incident and problem management. Lead continuous improvement initiatives to ensure process agility and alignment with evolving business requirements.
  • Lead continuous improvement initiatives, including evaluating and refining AI algorithms and predictive models to align with evolving business needs and platform scalability.
  • Drive modular and scalable incident management practices, adaptable to the complexities of a multi-service platform architecture.
  • Develop and deliver reports on incident and problem management metrics for stakeholders, including executive leadership, product management, and customer success teams, to provide insights into trends, risks, and opportunities for improvement.

What will you bring?

  • Strategic Incident and Problem Management Expertise:10-15 years of experience in IT incident and problem management, ideally within SaaS and platform-based environments, with a minimum of 5 years in a senior leadership capacity.
  • Modern Practices in Incident Management:Demonstrated expertise in using cutting-edge incident management tools (e.g.,incident.io, FireHydrant) and AI-driven solutions to streamline processes, drive rapid response, and enhance service reliability.
  • Problem Management:Expertise in leading comprehensive root cause analysis and problem resolution efforts, incorporating Google SRE principles for preventive actions.
  • Google SRE Methodologies:In-depth knowledge of Google SRE philosophies, including error budget management, service level indicators/objectives (SLIs/SLOs), and effective incident response strategies.
  • Platform and SaaS Experience:Strong understanding of platform-oriented operations within B2B SaaS, ideally with experience in supporting a pivot from product to platform. FinTech experience is advantageous but not required.
  • Leadership and Accountability:Proven record of building and leading high-performing teams, with an emphasis on holding teams accountable to clear standards and ensuring consistency in incident response and resolution.
  • Collaborative Communication Skills:Excellent ability to influence and collaborate with cross-functional teams and executive-level stakeholders. Skilled in delivering complex insights to both technical and non-technical audiences.
  • Innovation and Continuous Improvement:Ability to drive continuous improvement through innovative practices, data insights, and strategic thinking. An advocate for evolving incident/problem management to proactively support business goals.
  • Cross-cloud environments:Experience managing incident and problem resolution in cross-cloud environments, ideally with a focus on seamless integration of diverse platforms.

Preferred Qualifications:

  • Bachelor’s degree in Computer Science, Information Systems, or a related field; a Master’s degree is preferred.
  • ITIL Expert certification and familiarity with Google SRE principles; advanced certifications in cloud platforms (AWS, GCP, Azure) or incident management tools are highly advantageous.
  • Familiarity with leveraging AI and machine learning within incident and problem management to predict incidents, automate responses, or identify root causes, showcasing an ability to bring innovative solutions to the role.

J-18808-Ljbffr

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Why the UK Could Be the World’s Next AI Jobs Hub

Artificial Intelligence (AI) has rapidly moved from research labs into boardrooms, classrooms, hospitals, and homes. It is already reshaping economies and transforming industries at a scale comparable to the industrial revolution or the rise of the internet. Around the world, countries are competing fiercely to lead in AI innovation and reap its economic, social, and strategic benefits. The United Kingdom is uniquely positioned in this race. With a rich heritage in computing, world-class universities, forward-thinking government policy, and a growing ecosystem of startups and enterprises, the UK has many of the elements needed to become the world’s next AI hub. Yet competition is intense, particularly from the United States and China. Success will depend on how effectively the UK can scale its strengths, close its gaps, and seize opportunities in the years ahead. This article explores why the UK could be the world’s next global hub for artificial intelligence, what challenges it must overcome, and what this means for businesses, researchers, and job seekers.

The Best Free Tools & Platforms to Practise AI Skills in 2025/26

Artificial Intelligence (AI) is one of the fastest-growing career fields in the UK and worldwide. Whether you are a student exploring AI for the first time, a graduate looking to build your portfolio, or an experienced professional upskilling for career growth, having access to free tools and platforms to practise AI skills can make a huge difference. In this comprehensive guide, we’ll explore the best free resources available in 2025, covering AI coding platforms, datasets, cloud tools, no-code AI platforms, online communities, and learning hubs. These tools allow you to practise everything from machine learning models and natural language processing (NLP) to computer vision, reinforcement learning, and large language model (LLM) fine-tuning—without needing a huge budget. By the end of this article, you’ll have a clear roadmap of where to start practising your AI skills for free, how to build real-world projects, and which platforms can help you land your next AI job.

Top 10 Skills in Artificial Intelligence According to LinkedIn & Indeed Job Postings

Artificial intelligence is no longer a niche field reserved for research labs or tech giants—it has become a cornerstone of business strategy across the UK. From finance and healthcare to manufacturing and retail, employers are rapidly expanding their AI teams and competing for talent. But here’s the challenge: AI is evolving so quickly that the skills in demand today may look different from those of just a few years ago. Whether you’re a graduate looking to enter the industry, a mid-career professional pivoting into AI, or an experienced engineer wanting to stay ahead, it’s essential to know what employers are actually asking for in their job ads. That’s where platforms like LinkedIn and Indeed provide valuable insight. By analysing thousands of job postings across the UK, they reveal the most frequently requested skills and emerging trends. This article distils those findings into the Top 10 AI skills employers are prioritising in 2025—and shows you how to present them effectively on your CV, in interviews, and in your portfolio.