Anticipating the Good, the Bad, and the
Ugly: An Early Warning Approach for Conflict and Instability Analysis
Dr. Sean P. O’Brien
Center for Army Analysis
obrien@caa.army.mil
This document provides instructions for replicating the
principal analyses in the above referenced JCR article.
The multi-nomial logit analyses displayed at www.oocities.org/seanob88
were run using SPSS version 10.0 using the variables indicated at that website.
The Fuzzy Analysis of Statistical Evidence (FASE) estimates
were generated using a C++ program (version 1.2.2) developed by Dr. Yuan Yan
Chen (chen@caa.army.mil) at the Center
for Army Analysis, 6001 Goethals Road, Fort Belvoir, VA
22060-5230. Dr. Chen’s
program is available upon request and free of charge to anyone affiliated with
the United States government. It is available for a small fee to all others.
Please direct all inquiries and requests for the software to Dr. Chen at
the above address.
- The
FASE validation results can be replicated using the data included with this
documentation.
- Use
the country-year data for the period 1975-1999
- Sort
the data by year, then by country
- Place
the dependent variable in the third column of the spreadsheet (will
ultimately be in first column).
- Save
the Excel file with a name
- Replace
all missing values with –999
- Clear
the entire first row of everything (will contain the 3 input fields
specified in step 9)
- Delete
the first two columns (country and year)
- The
first cell in the upper left column contains the number of records to be
analyzed (3671); the second cell to the right contains the number of
variables to be analyzed (the dependent + all Independents=13); the third
cell to the right contains the number of categories on the dependent
variable (4).
- Save
that Excel file as a text file
- Execute
the FASE program
- Locate
the data text file using the “BROWSE” feature
- Specify
the proportion of records to be used for training in the % used for training
field.
·
The 1975-1984 training set used 1399 of the 3671 records or .3811
for training
·
The 1975-1989 training set used 2104 of the 3671 records or .5731
for training
·
The 1975-94 training set used 2876 of the 3671 records or .7834
for training
- For
these analyses, we used a window parameter of 1, a Frank parameter of .01,
and a confidence level of .8 (the default and an irrelevant parameter for
these analyses).
- Click
“GO.”
- Three
files are returned:
Cf.txt (ignore)
Class.txt
Output.dat
The class.txt file contains
possibility scores for each X variable observation associated with each class on
the dependent variable (useful for drawing curves).
The output.dat file contains the overall possibility scores associated
with each country-year’s macro-structural portfolio.
- Normalize
the possibility scores into probabilities
- Aggregate
the probabilities across the conflict event types (as described in the 6
decision rules in the JCR article) to derive the forecast level of intensity
of instability.
For additional detail, including a step-by-step
tutorial on FASE, see www.oocities.org/seanob88.