At this point, three keys terms should be clarified: 1) Incidents are any
deviation from the standard operations of a system that could, or does,
cause a service interruption; 2) A problem is the condition of having
multiple similar incidents, and 3) a known error is the identified root
cause of a problem.
Essentially, from ITIL we understand that there are two management forces at
work. First, there is incident management, which is concerned with restoring
service as quickly as possible, often using workarounds that address known
errors. Second, problem management is geared toward both proactively and
reactively addressing the underlying causal factors of incidents. Readers
might want to review the ITIL Service Support volume’s chapter on Problem
Management to gain a better understanding.
As complexity increases, the percentage of total system understanding held
by any one IT person will decrease. This is because the level of expertise
to build complex systems demands the involvement of multiple parties. There
just is not an alternative realistic option. Whether developed entirely
in-house, out-sourced or some combination thereof, there are multiple
people, even multiple organizations, involved.
Correspondingly, when incidents or problems occur, root cause analysis
demands review by the parties with the appropriate expertise. For example,
to build a large mission-critical clustered server, there will be
involvement from the vendor(s) of the hardware, the software vendors,
internal software development, IT engineering/release management teams,
security, operations and so on.
Problem Review Boards (PRB)
In the same manner that there are change advisory boards (CABs) for updates
to production systems, there must be a parallel group(s) reviewing incidents
to determine trends, problem identification and ultimately root cause and
mitigation.
Depending on the complexity of the organization, there may be one PRB
overall or a PRB per system. For that matter, some organizations may be so
small or simple that, for whatever reason, they do not need PRBs. In those
cases, it is recommended that they still understand the ITIL Problem
Resolution processes and adopt best practices into their organizations. For
organizations with complex systems, regardless of size, the implementation
of PRBs need to be seriously considered.
The goal of the PRB is to govern problem management reactively and
proactively. This is done through analyzing incidents as they happen,
reviewing historic trend data and staying abreast of current industry news
and vendor updates. For example, a switch may not have failed yet, but your
PRB may know of an operating system bug that had been identified and
eliminated by another organization using the same switch. Hence, it would be
advisable to assess risks and determine the best means to modify the switch
and mitigate incident risks proactively.
Continue on to find out how to structure a PRB….
StructureThe PRB must include representation from relevant stakeholders in order to
effectively review incident trend data for reactive problem management as
well as searching for risks from a proactive stance. This means the PRB
could be comprised of vendor personnel, consultants, IT operations, IT
release management, security, and so on. Incidents that appear to be
establishing a trend would then be assigned to teams that would search for
the underlying problem.
Once the underlying problem had been identified, a request for change would
be issued through the change management process to validate and enact the
recommendations of the PRB as they relate to the production systems and
various stored configurations. It is important to note that the PRB must not
spawn a separate change vector to production; rather, it must serve as an
input to the standard change management process which, just in case, must
have established means of handling emergency change requests.
Summary
Complexity is increasing the need for effective communication and
coordination among the various groups involved with complex systems. As this
happens along with technical specialization, enhanced processes will be
needed to meet service levels including availability and security. To foster
appropriate problem identification and root cause analysis, enterprises
should use problem review boards with the necessary stakeholders represented
in order to make decisions for both proactive and reactive problem
management.
How does your organization handle incident and problem management? If you
have any stories or examples you’d like to share, please email me at
george@spaffordconsulting.com.
Huawei’s AI Update: Things Are Moving Faster Than We Think
FEATURE | By Rob Enderle,
December 04, 2020
Keeping Machine Learning Algorithms Honest in the ‘Ethics-First’ Era
ARTIFICIAL INTELLIGENCE | By Guest Author,
November 18, 2020
Key Trends in Chatbots and RPA
FEATURE | By Guest Author,
November 10, 2020
FEATURE | By Samuel Greengard,
November 05, 2020
ARTIFICIAL INTELLIGENCE | By Guest Author,
November 02, 2020
How Intel’s Work With Autonomous Cars Could Redefine General Purpose AI
ARTIFICIAL INTELLIGENCE | By Rob Enderle,
October 29, 2020
Dell Technologies World: Weaving Together Human And Machine Interaction For AI And Robotics
ARTIFICIAL INTELLIGENCE | By Rob Enderle,
October 23, 2020
The Super Moderator, or How IBM Project Debater Could Save Social Media
FEATURE | By Rob Enderle,
October 16, 2020
FEATURE | By Cynthia Harvey,
October 07, 2020
ARTIFICIAL INTELLIGENCE | By Guest Author,
October 05, 2020
CIOs Discuss the Promise of AI and Data Science
FEATURE | By Guest Author,
September 25, 2020
Microsoft Is Building An AI Product That Could Predict The Future
FEATURE | By Rob Enderle,
September 25, 2020
Top 10 Machine Learning Companies 2020
FEATURE | By Cynthia Harvey,
September 22, 2020
NVIDIA and ARM: Massively Changing The AI Landscape
ARTIFICIAL INTELLIGENCE | By Rob Enderle,
September 18, 2020
Continuous Intelligence: Expert Discussion [Video and Podcast]
ARTIFICIAL INTELLIGENCE | By James Maguire,
September 14, 2020
Artificial Intelligence: Governance and Ethics [Video]
ARTIFICIAL INTELLIGENCE | By James Maguire,
September 13, 2020
IBM Watson At The US Open: Showcasing The Power Of A Mature Enterprise-Class AI
FEATURE | By Rob Enderle,
September 11, 2020
Artificial Intelligence: Perception vs. Reality
FEATURE | By James Maguire,
September 09, 2020
Anticipating The Coming Wave Of AI Enhanced PCs
FEATURE | By Rob Enderle,
September 05, 2020
The Critical Nature Of IBM’s NLP (Natural Language Processing) Effort
ARTIFICIAL INTELLIGENCE | By Rob Enderle,
August 14, 2020
Datamation is the leading industry resource for B2B data professionals and technology buyers. Datamation's focus is on providing insight into the latest trends and innovation in AI, data security, big data, and more, along with in-depth product recommendations and comparisons. More than 1.7M users gain insight and guidance from Datamation every year.
Advertise with TechnologyAdvice on Datamation and our other data and technology-focused platforms.
Advertise with Us
Property of TechnologyAdvice.
© 2025 TechnologyAdvice. All Rights Reserved
Advertiser Disclosure: Some of the products that appear on this
site are from companies from which TechnologyAdvice receives
compensation. This compensation may impact how and where products
appear on this site including, for example, the order in which
they appear. TechnologyAdvice does not include all companies
or all types of products available in the marketplace.