KEDS Data Sets
Most of the recent data sets use the standard .zip file compression format. To access the older sets with the .sit suffix, download the StuffIt Expander software.
Can't find the dyads you need here??
Check out the 10 Million International Dyadic Events (1990-2004) link at Gary King's data site. These data are provided by VRA, and Dale Thomas has written a Windows program to extract data from the file.
See the paragraphs below for a preview of each data category available on this website. You can also click on the pointers to learn more about each particular category.
CAMEO -- Conflict and Mediation Event Observations -- is the new coding scheme we have developed in conjunction with our current research on third-party mediation. CAMEO has several new features not found in the WEIS system we have used in our earlier work:
- The coding scheme is optimized for the study of mediation and contains a number of tertiary sub-categories specific to mediation
- We have substantially expanded the categories for "use of force" and can therefore make much finer distinctions between reported levels of violence
- We have combined a number of WEIS categories that, in our experience, cannot be reliably differentiated in machine coding.
- An extensive CAMEO code wiki (http://cameocodes.wikispaces.com/) has been implemented containing both the event and actor coding systems [NEW!].
Download CAMEO codebook (version 0.9b5)
Download CAMEO .verbs dictionary for TABARI (link will display the text of the file, which can be saved in your browser)
Download CAMEO Levant .actors dictionary for TABARI (includes nouns, adjectives and international actors)
Download TABARI .options file with CAMEO event labels
Last update: 18 July 2008
Most of the codes that are used in the data sets produced by the KEDS project are the standard WEIS codes originally developed by Charles McClelland (see "World Event/Interaction Survey (WEIS) Project, 1966-1978", ICPSR Study No. 5211) However, at various points we have experimented with introducing new codes into WEIS, borrowing most of these from the PANDA project. We assigned weights to the new codes that are comparable to the weights used in the Goldstein scale, and those weights are used in the aggregated data.
This data set is a compilation of almost thirty years of WEIS and CAMEO coded data specifically targeting events relating to states within the Levant. The raw data (1979-2009), a tab-delimited file of dyadic Goldstein-scaled totals (1979-2004), and coding dictionaries are available from this link.
This data set covers Turkey for the period 3 January 1992 to 31 July 2006 using the CAMEO coding scheme. It is based on Agence France Presse reports
This data set covers the states of the Gulf region and the Arabian peninsula for the period 15 April 1979 to 31 March 1999. The source texts prior to 10 June 97 were located using a NEXIS search command specifically designed to return relevant data.
These files contain WEIS-coded event data for an assortment of Central Asian states, including Afghanistan, Armenia, Azerbijan, Kazakstan, Kyrgistan, Tajikistan, Uzbekistan and Turkmenistan for the period May 1989 to July 1999. In addition to the lead-sentence coding, the "ALL" files include data retrieved from complete-story coding.
This data set contains WEIS-coded events for the major actors (including ethnic groups) involved in the conflicts in the former Yugoslavia from April 1989 through July 2003.
This data set contains WEIS-coded events for the major actors in West Africa from January 1989 through February 2002. The data was produced from full-story coding of Reuters articles. Most of the major opposition groups in the Liberian and Sierra leone civil wars are included in the data.
This data set contains about 5,000 records based on reports in international news sources of killings of five or more non-combatants anywhere in the world from January 1995 to the present. The data and the context of their collection are described in detail on the linked page and the codebook. The data are geo-coded and can be displayed using Google Earth; files for this can be downloaded from the linked page.
This data set contains tab-delimited files for third-party "mediation episodes" in the Levant (April 1979 - December 1998) and Balkans (June 1991 - May 1999). A mediation episode is defined as a specific mediator (e.g. USA or UN) meeting with both parties to a conflict within a period of a week; these are aggregated by month. These data were used in the paper Analyzing the Dynamics of International Mediation Processes in the Middle East and the former Yugoslavia (Deborah J. Gerner and Philip A. Schrodt).
These data sets were generated for the purpose of investigating interactions in regional conflicts. The project has traced events in several regional conflicts (each listed separately). The files are organized such that all files ending in .events contain event data as coded by KEDS. Files ending in .actors are the actor lists for each region. Files ending in .verbs are the verb patterns which code to a WEIS category. Files ending in .options and .class are KEDS preference files (described in the KEDS manual). Each is coded from Reuters lead sentences with date ranges and number of event given below.
This site contains older versions of various data sets and software. We are no longer working with these but they might have some utility in replication studies.