Diagnosis and Interpretation

About This Presentation

Title:

Diagnosis and Interpretation

Description:

We concentrate on diagnosis and interpretation because historically they are significant problems that AI has addressed And there are numerous and varied solutions ... – PowerPoint PPT presentation

Number of Views:343

Avg rating:3.0/5.0

Slides: 52

Provided by: foxr

Learn more at: https://websites.nku.edu

Category:

more less

Transcript and Presenter's Notes

Title: Diagnosis and Interpretation

1
Diagnosis and Interpretation

We concentrate on diagnosis and interpretation
because historically they are significant
problems that AI has addressed
And there are numerous and varied solutions,
providing us with an interesting cross-section of
AI techniques to examine
Diagnosis is the process of determining whether
the behavior of a system is correct
If incorrect, which part(s) of the system is(are)
failing
We often refer to the result of a diagnosis is
one or more malfunctions
The system being diagnosed can be an artificial
system (man-made) or natural system (e.g., the
human body, the ecology)
man-made systems are easier to diagnose because
we understand the systems thoroughly enough to
develop an accurate model
Interpretation is a related problem, it is the
process of explaining the meaning of some object
of attention

2
Data Driven Processes

While both diagnosis and interpretation have
goals of seeking to explain, the processes are
triggered by data
We use the data (symptoms, manifestations,
observations) to trigger possible reasons for why
those data have arisen
Thus, these problems are distinct from
goal-driven problems
Like planning, design, and control
control encompasses planning, interpretation,
diagnosis and possibly prediction
One way to view diagnosis/interpretation is that
given data, explain why the data has arisen
Thus, it is an explanation-oriented process
the result of the process is an explanation which
attempts to describe why we have the resulting
behavior (malfunctions or observations)
we will reconsider this idea (explanation as a
process) later

3
The Diagnostic Task

Data triggers causes (hypotheses of malfunctions,
or potential diagnoses), typically an
associational form of knowledge
Hypotheses must be confirmed through additional
testing and inspection of the situation
Hypotheses should be as specific as possible, so
they need to be refined (e.g., given a general
class of disease, find the most specific subclass)

4
Forms of Interpretation

The idea behind interpretation is that we are
trying to understand why something has happened
Diagnosis is a form of interpretation in that we
are trying to understand a systems deviation
from the norm
what caused the system to deviate? what
components have broken down? why?
Diagnosis is a form of interpretation, but there
are other forms
Data analysis what phenomenon caused the data
to arise, e.g., studying astronomical phenomena
by looking at radio signals, or looking at blood
clots and decided on blood types
Object identification viewing a description (in
some form, whether visual or data) of an object,
what is the object
Speech recognition interpret the acoustic
signal in terms of words/meanings
Communication what is the meaning behind a
given message? This can be carried over to
analysis of artwork
Evidence analysis trying to decipher the data
from a crime scene to determine what happened,
who committed the crime and why
Social behavior explaining why someone acted in
a particular way

5
Some Definitions

Let us assume that our knowledge of a given
system is contained as a model
A diagnosis is a particular hypothesis of how the
system differs from the model
what component(s) is(are) not functioning as
modeled?
A diagnosis is a description of one possible
state of the system where the state is not the
normal state
A consistency-based diagnosis is a diagnosis
where each component of the system is labeled as
either normal or abnormal (functioning correctly
or not) such that the description is consistent
with the observations
If there are n components in a system, there are
2n different diagnoses because we must consider
that multiple components may fail
A minimal diagnosis is a diagnosis consisting of
some set of components C such that there is no
consistent diagnosis that is a subset of C

6
First Interpretation System

The system Dendral, from 1966, was given mass
spectrogram data and inferred the chemical
composition from that data
The input would be the mass of the substance
along with other experimental lab data
Dendral would apply knowledge of atomic masses,
valence rules and connectivity among atoms to
determine combinations and connections of the
atoms in the unknown compound
The number of combinations grows exponentially
with the size (mass) of the unknown compound)
Dendral used a plan-generate-test process
First, constraints would be generated based on
heuristic knowledge of what molecules might
appear given the initial input and any knowledge
presented about the unknown compound

7
Dendral Continued

The planning step would constrain the generate
step
At this step, graphical representations of
possible molecules would be generated
The constraints are necessary to reduce the
number of possible graphs generated
The final step, testing, attempts to eliminate
all but the correct representations
Each remaining graph is scored by examining the
candidate molecular structure and comparing it
against mass spectrometry rules and reaction
chemistry rules
Structures are discarded if they are inconsistent
with the spectrum or known reactions
Any remaining structures are presented the
operator
At this point, the operator can input additional
heuristic rules that can be applied to this case
to prune away incorrect structures
These rules are added to the heuristics, so
Dendral learns
A thorough examination is presented in
http//profiles.nlm.nih.gov/BB/A/B/O/M/_/bbabom.pd
f

8
Mycin

Mycin was the next important step in the
evolution of AI expert systems and AI in medicine
The first well known and well received expert
system, it also presented a generic solution to
reasoning through rules
It provided uncertainty handling in the form of
certainty factors
After creating Mycin, some of the researchers
developed the rule-based language E-Mycin
(Essential or Empty Mycin) so that others could
develop their own rule-based expert systems
Mycin had the ability to explain its conclusions
by showing matching rules that it used in its
chain of logic
Mycin outperformed the infectious disease experts
when tested, coming to an acceptable therapy in
69 of its cases
A spinoff of Mycin was a teaching tool called
GUIDON which is based on the Mycin knowledge base

9
The Importance of Explanation

The Dendral system presented an answer but did
not explain how it came about its conclusions
Mycin could easily generate an explanation by
outputting the rules that matched in the final
chain of logic
E.g., rule 12 rule 15 ? rule 119 ? rule 351
A user can ask questions like why was rule 351
selected? to which Mycin responds by showing the
rules conditions (lhs) and why those conditions
were true
The reason why a rule is true is usually based on
previous rules being true leading to conclusions
that made the given rule true
By being able to see the explanation, one can
feel more confident with the systems answers
But it is also a great tool to help debug and
develop the knowledge base

10
Mycin Sample Rules
RULE116 IF 1) the identity of ORGANISM-1 is not
known 2) the gram stain of ORGANISM-1 is not
known 3) the morphology of ORGANISM-1 is
not known 4) the site of CULTURE-1 is csf
5) the infection is meningitis 6) the age
(in years) of the patient is less than equal to
.17 THEN There is weakly suggestive evidence (.
3) that the category of ORGANISM-1 is
enterobacteriaceae RULE050 IF 1) the morphology
of ORGANISM-1 is rod 2) the gram stain of
ORGANISM-1 is gramneg 3) the aerobicity of
ORGANISM-1 is facultative 4) the infection
with ORGANISM-1 was acquired while the patient
was hospitalized THEN There is evidence that
the category of ORGANISM-1 is enterobacteriaceae
11
Systems Generated From Emycin

SACON Structural Analysis CONsultant
Puff pulmonary disorders
originally implemented in Emycin before being
re-implemented as an OO system

IF 1) The material composing the sub-structure
is one of the metals, and 2) The analysis
error that is tolerable is between 5 and 30,
and 3) Then non-dimensional stress of the
sub-structure gt .9 , and 4) The number of
cycles the loading is to be applied is between
1000 and10000 THEN It is definite (1.0) that
fatigue is one of the stress behavior phenomena
in the sub-structure
I f 1) The mmf/mmf-predicted ratio is 35..45
the fvc/fvc-predicted ratio gt 88 2) The
mmf/mmf-predicted ratio is 25..35 the
fvc/fvc-predicted ratio lt 88 Then There is
suggestive evidence (.5) that the degree of
obstructive airways disease as indicated by the
MMF is moderate, and it is definite (1.8) that
the following is one of the findings about the
diagnosis of obstructive airways disease Reduced
mid-expiratory flow indicates moderate airway
obstruction.
12
A Fuzzy Logic Approach

The process is one of
Fuzzifying the inputs
blood pressure of 145 mmHg can be denoted as
low/0, medium/.4, high/.6
Fuzzy reasoning
applying rules similar to Mycin
recall that fuzzy systems do poorly with lengthy
chains of rules, so we will primarily use fuzzy
logic in diagnosis when there are few rules and
limited chains of logic
we use fuzzy logic and set theory to compute AND,
OR, NOT, Implication, Difference, etc. as needed
for the rules
Fuzzy classes
given the result of our rules, we defuzzify by
identifying which class (malfunction(s)/diagnosis(
es)) is rated the highest
FL has been used for automotive diagnosis,
clinical lab test interpretation, mammography
interpretation,

13
Analyzing Mycins Process

A thorough analysis of Mycin was performed and it
was discovered that the rule-based approach of
Mycin was actually following three specific tasks
Data are first translated using data abstraction
from specific values to values that may be of
more use (e.g., changing a real value into a
qualitative value)
The disease(s) is then classified
The hypothesis is refined into more detail
By considering the diagnostic process as three
related but different tasks, it allows one to
more clearly understand the process
With that knowledge, it becomes easier to see how
to solve a diagnostic task use classification

14
Classification as a Task

One can organize the space of diagnostic
conclusions (malfunctions) into a taxonomy
The diagnostic task is then one of searching the
taxonomy
Coined hierarchical classification
The task can be solved by establish-refine
Attempt to establish a node in the hierarchy
If found relevant, refine it by recursively
trying to establish any of the nodes children
If found non-relevant, prune that portion of the
hierarchy away and thus reduce the complexity of
the search
How does one establish a node as relevant?
Here, we can employ any number of possible
approaches including rules
Think of the node as a specialist in
identifying that particular hypothesis
Encode any relevant knowledge to recognize
(establish) that hypothesis in the node itself

15
Supporting Classification

The establish knowledge can take on any number of
different forms
Rules (possibly using fuzzy logic or certainty
factors, or other)
Feature-based pattern matching
Bayesian probabilities or HMM
Neural network activation strength
Genetic algorithm fitness function
In nearly every case, what we are seeking are a
set of pre-determined features
Which features are present? Which are absent?
How strongly do we believe in a given feature?
If the feature is not found in the database, how
do we acquire it?
By asking the user? By asking for a test result?
By performing additional inference?
Notice that in the neural network case, features
are inputs whereas in most of the rest of the
cases, they are conditions usually found on the
LHS of rules

16
Feature-based Pattern Matching

A simple way to encode associational knowledge to
support a hypothesis is to enumerate the features
(observations, symptoms) we expect to find if the
hypothesis is true
We can then enumerate patterns that provide a
confidence value that we might have if we saw the
given collection of features
Consider for hypothesis H, we expect features F1
and F2 and possibly F3 and F4, but not F5 where
F1 is essential but F2 is somewhat less essential
F1 F2 F3 F4 F5 Result
yes yes yes yes no confirmed
yes yes ? ? no likely
yes ? ? ? no somewhat likely
? yes ? ? no neutral/unsure
? ? ? ? yes ruled out
? means dont care
We return the result from the first pattern to
match, so this is in essence a nested if-else
statement

17
Data Abstraction

In Mycin, many rules were provided to perform
data abstraction
In a pattern matching approach, we might have a
feature of interest that may not be directly
evident from the data but the data might be
abstracted to provide us with the answer
Example Was the patient anesthetized in the
last 6 months?
No data indicates this, but we see that the
patient had surgery 2 months ago and so we can
infer that the patient was anesthetized
Data abstractions might be domain specific
In which case we have to codify each inference as
shown above
Or may be domain independent
Such as temporal reasoning or spatial reasoning
Another form is to discard a specific value in
favor of a more qualitative value (e.g.,
temperature 102 becomes high fever)

18
Example 1 Automotive Diagnosis
19
Example 2 Syntactic Debugging
20
Ex 3 Linux User Classification
21
Lack of Differentiation

Notice that through the use of simple
classification (what is called hierarchical
classification), one does not differentiate among
possible hypotheses
If two hypotheses are found to be relevant, we do
not have additional knowledge to select one
What if X and Y are both established with X being
more certain than Y, which should we select?
What if X and Y have some form of association
with each other such as mutually incompatible, or
jointly likely?
We would like to employ a process that contains
such knowledge as to let us select only the most
likely hypothesis(es) given the data
In a neural network, we would only select the
most likely node, and similarly for an HMM, the
most likely path

22
Abduction

This leads us to abduction, a form of inference
first termed by philosopher Charles Peirce
Peirce saw abduction as the following
Deduction says that
If we have the rule A ? B
And given that A is true
Then we can conclude B
But abduction says that
If we have the rule A ? B
And given that B is true
Then we can conclude A
Notice that deduction is truth preserving but
abduction is not
We can expand the idea of abduction to be as
follows
If A1 v A2 v A3 v v An ? B
And given that B is true
And if Ai is more likely than any other Aj
(1ltjltn), then we can infer that Ai is true
for this to work, we need a way to determine
which is most likely

23
Inference to the Best Explanation

Another way to view abduction is as follows
D is a collection of data (facts, observations,
symptoms) to explain
H explains D (if H is true, then H can explain
why D has appeared)
No other hypothesis explains D as well as H does
Therefore H is probably correct
Although the problem can be viewed similar to
classification we need to locate an H that
accounts for D
We now need additional knowledge, explanatory
knowledge
What data can H explain?
How well can H explain the data?
Is there some way to evaluate H given D?
Additionally, we will want to know if
H is consistent
Did we consider all Hs in our domain?
What complicates generating a best explanation is
that H and D are probably not singletons but sets

24
Continued

Assume H is a collection of hypotheses that can
all contribute to an explanation, H H1, H2,
H3, , Hn
D is a collection of data to be explained, D
d1, d2, d3, , dn
a given hypothesis can account for one or more
data (e.g., H3 can explain d1, d5)
assume that we have ranked all elements of H with
some scoring algorithm (Bayesian probability,
neural network strength of activation,
feature-based pattern matching, etc)

The abductive process is to generate the best
subset of H that can explain D
what does best mean?

25
Ways to View Best

We will call a set of hypotheses that can explain
the data as a composite hypothesis
The best composite hypothesis should have these
features
Complete explains all data (or as much as is
possible)
Consistent there are no incompatibilities among
the hypotheses
Parsimonious the composite has no superfluous
parts
Simplest all things considered, the composite
should have as fewer individual hypotheses as
possible
Most likely this might be the most likely
composite or the composite with the most likely
hypotheses (how do we compute this?)
In addition, we might want to include additional
factors
Cheapest costing (if applicable) the composite
that would be the least expensive to believe
Generated with a reasonable amount of effort
generating the composite in a non-intractable way
(abduction is generally an NP-complete problem)

26
Internist Rule based Abduction

One of the earliest expert systems to apply
abduction was Internist, to diagnose internal
diseases
Internist was largely a rule-based system
The abduction process worked as follows
Data trigger rules of possible diseases
For each disease triggered, determine what other
symptoms are expected by that disease, which are
present and which are absent
Generate a score for that disease hypothesis
Now compare disease hypotheses to differentiate
them
If one hypothesis is more likely, try to confirm
it
If many possible hypotheses, try to rule some out
If a few hypotheses available, try to
differentiate between them by seeking data (e.g.,
test results) that one expects that the others do
not
The diagnostic conclusion are those hypotheses
that still remain at the end that each explain
some of the data

27
Neural Network Approach

Paul Thagard developed ECHO, a system to learn
explanatory coherence
ECHO was developed as a neural network where
nodes represent hypotheses and data
links represent potential explanations between
hypotheses and data
and hypothesis relationships (mutual
incompatibilities, mutual support, analogy)
Unlike a normal neural network, nodes here
represent specific concepts
weights are learned by the strength of
relationships are found in test data
In fact, the approach is far more like a Bayesian
network with edge weights representing
conditional probabilities (counts of how often a
hypothesis supports a datum)
When data are introduced, perform a propagation
algorithm of the present data until the
hypothesis nodes and data nodes have reached a
stable state (similar to a Hopfield net) and then
the best explanation are those hypothesis nodes
whose probabilities are above a preset threshold
amount

28
Ex Evolution (DH) vs Creationism (CH)
29
(No Transcript)
30
Probabilistic Approach(es)

Pearls Belief networks and the generic idea
behind the HMM are thought to be abductive
problem solving techniques
Notice that there is no explicit coverage of
hypotheses to data, for instance, we do not
select a datum and ask what will explain this?
Instead, the solution is derived to be the best
explanation but where the explanation is
generated by finding the most probable cause of
the collection of data in a holistic approach
The typical Bayesian approach contains
probabilities of a hypothesis (state) being true,
of a hypothesis transitioning to another
hypothesis, and of an output being seen from a
given hypothesis
But there is no apparent mechanism to encode
hypothesis incompatibilities or analogies

31
Example

In the diagram of a system
I represents inputs
O represents outputs
Ab represent component parts that might be
malfunctioning
In the formula
dc is a diagnostic conclusion (malfunction) based
on input and output i, o

32
The Peirce Algorithm

The previous strategies assume that knowledge is
available in either a rule-based or
probabilistic-based format
The Peirce algorithm instead uses generic tasks
The algorithm has evolved over the course of
construction several knowledge-based systems
The basic idea is
Generate hypotheses
this might be through hierarchical
classification, neural network activity, or other
Instantiate generated hypotheses
for each hypothesis, determine its explanatory
power (what it can explain from the data),
hypothesis interactions (for the other generated
hypotheses, are they compatible, incompatible,
etc) and some form of ranking
Assemble the best explanation
see the next slide

33
The Assembly Algorithm

Examine all data and see if there are any data
that can only be explained by a single hypothesis
such a hypothesis is called an essential
hypothesis
Include all essential hypotheses in the composite
Propagate the affects of including these
hypotheses (see next slide)
Remove from the data all data that can be
explained
Start from the top (this may have created new
essentials)
Examine remaining data and see if there are any
data that can only be explained by a superior
hypothesis
such a hypothesis would clearly beat all
competitors by having a much higher ranking
Include all superior hypotheses in the composite,
propagate and remove
Start from the top (this may have created new
essentials)
Examine remaining data and see if there are any
data that can only be explained by a better
hypothesis
such a hypothesis would be better than all
competitors
Include all better hypotheses in the composite,
propagate and remove
Start from the top (this may have created new
essentials)
If there are still data to explain, either guess
or quit with unexplained data

34
Propagation

The idea behind the Peirce algorithm is to build
on islands of certainty
If a hypothesis is essential, it is the only way
to explain something, it MUST be part of the best
explanation
If a hypothesis is included in the composite, we
can leverage knowledge of how that hypothesis
relates to others
If the hypothesis, say H1, is incompatible with
H2, since we believe H1 is true, H2 must be
false, discard it
If hypothesis H1 is very unlikely to appear with
H2, we can downgrade H2s ranking
If hypothesis H1 is likely to appear with H2, we
can either reconsider H2 or just bump up its
ranking
If hypothesis H1 can be inferred to be H2 by
analogy, we can include H2
Since H1 was included because it was the only (or
best) way to explain some data, we build upon
that island of certainty by perhaps creating new
essentials because H1 is incompatible with other
hypotheses

35
Layered Abduction

For some problems, a single data to hypothesis
mapping is insufficient
Either because we have more knowledge to bring to
bear on the problem or because we want an
explanation at a higher level of reasoning
For instance, in speech recognition, we wouldnt
want to just generate an explanation of the
acoustic signal as a sequence of phonetic units
So we map the output of one level into another
The explanation of one layer becomes the input of
the next layer we explain the phonetic unit
output as a sequence of syllables, and we explain
the syllables as a sequence of words, and then
explain the sequence of words as a meaningful
statement
We can use partially formed hypotheses at a
higher level to generate expectations for a lower
layer thus giving us some top-down guidance

36
Example Handwritten Character Recognition
(CHREC)
37
Overall Architecture

The system has a search space of hypotheses
the characters that can be recognized
this may be organized hierarchically, but here,
its just a flat space a list of the characters
each character has at least one recognizer
some have multiple recognizers if there are
multiple ways to write the character, like 0
which may or may not have a diagonal line from
right to left

After characters are generated for each
character in the input, the abductive
assembler selects the best ones to account for
the input
38
Explaining a Character

The features (data) found to be explained for
this character are three horizontal lines and two
curves
While both the E and F characters were highly
rated, E can explain all of the features while
F cannot, so E is the better explanation

39
Top-down Guidance

One benefit of this approach is that, by using
domain dependent knowledge
the abductive assembler can increase or decrease
individual character hypothesis beliefs based on
partially formed explanations
for instance, in the postal mail domain, if the
assembler detects that it is working on the zip
code (because it already found the city and state
on one line), then it can rule out any letters
that it thinks it found
since we know we are looking at Saint James, NY,
the following five characters must be numbers, so
I (for one of the 1s, B for the 8, and O
for the 0 can all be ruled out (or at least
scored less highly)

40
Full Example in a Natural Language Domain
41
Model-based Diagnosis Functional

In all of our previous examples of diagnosis and
interpretation, our knowledge was associational
We associate these symptoms/data with these
diseases/malfunctions
This is fine when we do not have a complete
understanding the system
Medical diagnosis
Speech recognition
Vision understanding
What if we do understand the system?
E.g., a human-made artifact
If this is the case, we should be able to provide
knowledge in the form of the function that a
given component will provide in the system and
how that function is achieved through its
behavior (process)
Debugging can be performed by simulating
performance with various components not working

42
The Clapper Buzzer

This mechanical device works as follows
When you press the button (not shown) it
completes the circuit causing current to flow to
the coil
When the magnetic coil charges, it pulls the
clapper hand toward it

When the clapper hand moves, it disconnects the
circuit causing the coil to stop pulling the hand
and then hand falls back, hitting a bell (not
shown) causing the ringing sound
This also reconnects the circuit, and so this
process repeats until the button is no longer
pressed

43
Generating a Diagnosis

Given a functional representation, we can reason
over whether a function can be achieved or not
Hypothetical or what would happen if reasoning
What would happen if the coil was not working?
What would happen if the battery was not charged?
What would happen if the clapper arm were
blocked?
We can also use the behavior and test results to
find out what function(s) was not being achieved
With the switch pressed, we measure current at
the coil, so the coil is being charged
We measure a magnetic attraction to show that the
coil is working
We do not hear a clapping sound, so the magnetic
attraction is either not working, or the acoustic
law is not being fulfilled
Why not? Perhaps the arm is not magnetic?
Perhaps there is something on the arm so that
when it hits the bell, no sound is being emitted

44
Model-based Diagnosis Probabilistic

While a functional representation can be useful
for diagnosis, it is somewhat problem independent
FRs can be used for prediction (WWHI reasoning),
diagnosis, planning and redesign, etc
Diagnosis typically is more focused, so we can
create a model of system components and their
performance and enhance the system with
probabilities
Failure rates can be used for prior probabilities
Evidential probabilities can be used to denote
the likelihood of seeing a particular output from
a component given that it has failed
Bayesian probabilities can then be easily computed

45
Example

The device consists of 3 multipliers and 2 adders
F computes ACBD
G computes BDCE
Given the inputs, F should output 12 but computes
10
Given the inputs, G should output 12 and does
We use the model to compute the diagnosis
Possible malfunctions are with M1, M2, A1 but not
M3 or A2
If we can probe the inside of the machine
we can obtain values for X, Y and Z to remove
some of the contending malfunction hypotheses

We can employ probabilities of component failure
rate and likelihood of seeing particular values
given the input to compute the most likely cause
note it could be multiple component failure
If we have a model of the multiplier and adder,
we can also use that knowledge to assist in
diagnosis

46
Neural Network Approach

Recall that neural networks, while trainable to
perform recognition tasks, are knowledge-poor
Therefore, they seem unsuitable for diagnosis
However, there are many diagnostic tasks or
subtasks that revolve around
data interpretation
visual understanding
And neural networks might contribute to diagnosis
by solving these lower level tasks
NNs have been applied to assist in
Congestive heart failure prediction based on
patient background and habits
Medical imaging interpretation for lung cancer
and breast cancer (MRI, chest X-ray, catscan,
radioactive isotope, etc)
Interpreting forms of acidosis based on blood
work analysis

47
Case-Based Diagnosis

Case based reasoning is most applicable when
There are a sufficiently large number of cases
There is knowledge of how to manipulate a
previous case to fit the current situation
This is most common done with planning/design,
not diagnosis
So for diagnosis, we need a different approach
Retrieve all cases that are deemed relevant for
the current input
Recommend those cases that match closely by
combining common diagnoses, a weighted voting
scheme
Supply a confidence based on the strength of the
votes
If deemed useful, retain the case to provide the
system with a mechanism for learning based on
new situations
This approach has been employed by GE for
diagnosing gas engine turbine problems

48
AI in Medicine

The term (abbreviated as AIM) was first coined in
1959 although actual usage didnt occur until the
1970s with Mycin
Surprisingly using AI for medical diagnosis has
largely not occurred in spite of all of the
research systems developed, in part because
the expert systems impose changes to the way that
a clinician would perform their task (for
instance, the need to have certain tests ordered
at times when needed by the system, not when the
clinician would normally order such a test)
the problem(s) solved by the expert system is not
a particular issue needing solving (either
because the clinician can solve the problem
adequate, or the problem is too narrow in scope)
the cost of developing and testing the system is
prohibitive

49
AIM Today

So while AI diagnosis still plays a role in AIM,
it is a small role, much smaller than those in
the 1980s would have predicted
Today, AIM performs a variety of other tasks
Aiding with laboratory experiments
Enhancing medical education
Running with other medical software (e.g.,
databases) to determine if inconsistent data or
knowledge has been entered
for instance, a doctor prescribing medication
that the patient is known to be allergic too
Generating alerts and reminders of specific
patients to nurses, doctors or the patients
themselves
Diagnostic assistance rather than performing
the diagnosis, they help the medical expert when
the particular problem is of a rare case
Therapy critiquing and planning, for instance by
finding omissions or inconsistencies in a
treatment
Image interpretation of X-Rays, catscans, MRI, etc

50
AI Systems in Use

Puff interpretation of pulmonary function tests
has been sold to hundreds of sites world-wide
starting as early as 1977
GermWatcher used in hospitals to detect
in-patient acquired infections by monitoring lab
data on culture data
PEIRS pathology expert interpretive reporting
system is similar, it generates 80-100 reports
daily with an accuracy of about 95, providing
reports on such things as thyroid function tests,
arterial blood gases, urine and plasma
catecholamines, glucose test results and more
KARDIO a decision tree learning system that
interprets ECG test results
Athena decision support system implements
guidelines for hypertension patients to instruct
them on how to be more healthy, in use since 2002
in clinics in NC and northern CA

51
Continued

PERFEX an expert rule-based system to assist
with medical image analysis for heart disease
patients
Orthoplanner plans orthodonture treatments
using rule-based forward and backward chaining
and fuzzy logic, in use in the UK since 1994
PharmAde and DoseChecker expert systems to
evaluate drug therapy prescriptions given the
patients background for inaccuracies, negative
interactions, and adjustments, in use in many
hospitals starting in 1996/1994
IPROB intelligent clinical management system to
keep track of obstetrics/gynecology patient
records and cases, risk reduction, decision
support through distributed databases and rules
based on hospital guidelines, practices, etc, in
use since 1995