239x Filetype PDF File size 0.62 MB Source: files.stlouisfed.org
79
Richard G. Anderson and William G. Dewald
Richard G. Anderson is a researchofficer atthe Federal Reserve Bank
ofSt Louis, William C. Dewald is director ofresearchatthe Federal
Reserve Bank ofSt. Louis.
II Replication and Scientific
Standards in Applied Economics
a Decade After the Journal of
Money, Credit and Banking
Project
INCE EARLY1993, the Research Department an annotated version of the computer program
oftheFederalReserve Bank ofSt. Louis has made is prepared and all statistical results recalculated.
the data and programs for articles published in Finally, bibliographic and other references are
the Bank’s Reviewavailable to the public on its checkedby the analyst against original source
1 docmnents. We believe this practice both assures
electronic bulletin board. During the first year,
files from articles in the Review were downloaded the accuracy of the empirical results and allows
fromthe bulletin board more than 200 times. the interested reader to delve into the details
Morerecently, about 30 files havebeen down- of the author’s research.
loaded each month.
TheResearchDepartment ofthe Bank develops THE ROLE OF DATA IN ECONOMIC
the program and data files on our bulletin board EXPERIMENTS
during a replication of each article prior to pub- Although empirical knowledge inboth the
lication. A research analyst first checks the physical and social sciences arises from repeated
author’s data against original sources. Because experiments, therole of datadiffers. Inthe physical
databases may havebeen updated or revised sciences, scientists controla relatively smallnumber
after the research began, thiscart require searching of variables such as temperature, atmospheric
for the original published data. In a few cases, pressure, diet or family characteristics. Since
data errors have been corrected, fortunately with some variables are neither observed nor controlled,
only minorimpact on the author’s results. Next, no two repetitions of an experiment will be
Thebulletin board is advertised as the Federal Reserve Anderson (1986)summarizedtheJournalofMoney, Credit
Economic Database, orFRED. FRED’s phonenumber is and Bankingproject mentioned in the title,
(314) 621-1824. (TheFederalReserveSystem does not
have aserveron theInternet,) Dewald, Thursby and
NOVEMBER/DECEMBER 1994
80
identical. Response surface analysis and the newer The trepidation of authors aside, scientific
field of research synthesis providetools for ana- progress depends on challenging received wisdom.
lyzing the dependence of experimental results In applied economics, these challenges fall into
2
on the settings ofthe conditioning variables. three categories: replication of published results
In economics, however, unlike the physical using the previous authors’ data and programs;
sciences, researchers can only condition on the applying new statistical methods or techniques
observed values of the environmental variables, to authors’ datasets; and application ofexisting
not control them. Consider a simple model of statistical methods (including those used by
4
an economic experiment: previous authors) to new datasets. That most
applied economic research falls within the third
1. Form hypotheses. category is not surprising, since the first two
2. Collect data. depend on access toprevious authors’ datasets.
3. Develop theoretical and econometric Only with the authors’ data may the reader
repeat, or replicate, all five steps of the scientific
framework. experiment. Selecting a new set of values for the
4. Estimate. conditioning variables from published sources
5. Test hypotheses, draw conclusions. may yield results close to those obtained by
theauthor, or results that are quite different.
The values of the conditioning variables are Unfortunately, it is difficult to predict the sensi-
collected in step 2, Published articles typically tivity of authors’ results to variations in the values
describe steps 1, 3 and 5,but are most often silent ofthe conditioning variables. For an example
on step 2. In principle, a researcher armed with ofthe wide range of results that may arise when
the values ofthe conditioning variables and the mixing different sources and vintages of data, see
computer code for step 4 should be able to exactly thecomputer simulation experiment reported in
3 Dewald, Thnrsby and Anderson (1986),
reproduce an economic experiment. Unlike the
physical sciences, the experiment is deterministic,
given the data. THE JMCB PROJE:CT AF:TER 10 YEARS
Appraising the robustness of the results of TheJournal ofMoney, Credit and Banking
an economic experiment requires knowing the project, conducted from 1982-84 at the editorial
values of the conditioning variables used by the offices ofthe JMCB at The Ohio State University
researcher. Obtaining the data may sometimes in Columbus, was the first attempt by a profes-
be difficult. Datasets and programs may be mislaid sional journal to make authors’ programs and
or lost during the interval between completion of 5
data available to its readers. During the project,
the research andpublication ofan article. Further, the JIVICBasked authors to submit data and pro-
requests to authors for data may raise suspicions grams to the journal’s office. Conceptually, we
that the reader hopes (or expects) to find errors regarded the research reported in each article
in theauthors’ research. An individual researcher as the outcome ofan experiment. A complete
has strong incentives not to share data and pro- understanding of the experiment required the
grams. Ifthe materials are shared and results researchers’ data and computer programs, as
confirmed, the confirmation provides little (if any) well as the published summary descriptions and
reward to the researcher beyond the original conclusions. For a subset of these submissions,
publication of his findings. Ifresults are found weattempted torepeat steps 2 and 4bycollecting
faulty, however, the researcher faces the likeli- data from the sources cited by the authors and
hood of some professional embarrassment. re-running the authors’ computer programs.
2 SeeCooper andHedges(1994). Theprimary research team wasWilliam Dewald, Jerry
Acomplication notdealt with hereare errors and inconsis- Thursby, Richard Anderson and Hashem Dezbaksh. The
tencies in econometric computer programs. In the Journalof project’sfindings are summarized in Dewald, Thursby and
Money, Credit andBanking project (described furtherbelow), Anderson (1986). The projectwas supported in part by the
we requestedthat authors providethe version, release date National Science Foundation,
andserial number of the computer program used fortheir
estimation. SeeLovell and Selover (1994) for examples of
the variation in econometric packages.
Various classification schemes and nomenclatures have
beendiscussed by Kane (1984), M’ittelstaedt andZorn (1984),
Hubbard andVetter (1991), Lindsay and Ehrenberg (1993),
and Fuess (1994).
FEDERALRESERVE BANKOF ST. LOUIS
81
concluded that it is important for journals to
IMe request data from authors immediately following
completion of the research, and for the journal
Dets forthe4*408Pr ject to retain the data to avoid its loss during the
interval between completion ofthe research
aalasesnaqueateay MolttttwøYear and publication of the paper.
throug M*y1982 In the second part of theJMCB project, we
198344 as ~ ~ 88 69 studied whether the materials submitted by
Js~ ~ 1 authors were in fact sufficient to allow another
eb. 211 S 64 researchertorepeattheirexperiment.
Mefeh G 9\ 4 t I Formany articles, repeating step 2—searching
AM I 28 ‘1 1
0! 4 11 2 for the authors’ data in their stated sources—was
June 1 G 62impossible.Descriptionsofsourceswereeither
S 4 s too vague to allow us to locate the data and/or
Aug 4 G 2 the datawere not included in the cited sources.
Sep 442! 1 Although 54 datasets were submitted tothe
JMCB during the project, we judged only eight
Get 1 824assatisfactoryand14asvaluelessinhelpingus
Nov St understand the authors’ published work. Others
Dec 0 1 8 4 were deficient in at least one important respect.
For a few articles, we discovered data errors
asesetsAvanmbtnylasuaakàtMay49* during comparison topublished sources. In the
Ba s~s ass a as most severe case, we found that an author’s con-
Feb 6 4 6 2 clusions were reversed (prior to publication of
Ma 2 6 the article) when an error was corrected. Where
Agg~ 3 8 2 the data were adequate, we usually obtained
14ev 4 577 ~1 S numerical results from authors’ programs very
close to those reported by the authors.
Totat S 15 22424 17 1* ‘14 17 Beyond encouraging readers to explore authors’
‘N methodology and the robustness of published
501)140 JMOIð thtdfte,steff N t~ata~ts results, we believed that requesting data and
aysabe cmtheJMCastaff programs from authors would encourage them
to exercise added care during their research.
Wefound during the JMCB project that many Wealso expected that other journals would adopt
authors could not furnish data and program similar requirements toincrease the value oftheir
following publication of their articles. We ini- articles to readers. Although the JMCB project
tially requested data from the authors of 62 articles stimulated discussion of the role of replication
published during 1980-82, prior to the beginning in economics, no other journal adopted a policy
ofthe project on July 1, 1982. About one third of requesting data from authors during the I 980s.
ofthe authors did not respond toeither a first or Somejournals adopted editorial statements that
second request for data. Among the responding authors should stand ready to provide data and
authors, one-half either could not locate their programs to other researchers. Such statements,
data or chose not to submit them. Most of these in and ofthemselves, maynot solve the two major
authors said that they could have done so if the problems identified during the project: Dataoften
materials had been requested when the manu- are mislaid prior to publication of the article, and
script was first submitted to the JMCB. Data and the author may he suspicious of the motives of a
programs were often apparently mislaid during researcher requesting his data and programs.
the relatively long delay between completion of A decade after theJMCBproject, the replication
the research and publication of its findings. of previous studies as a part ofnew research seems
Wenext requested data from the authors of an infrequent occurrence. During the last decade,
papers that either had only recently been accepted no papers or notes in the JMCBhave focused
for publication or were under editofal review primarily on replication, and only about two
More than three-fourths furnished their data. We papers per year have included a direct compari-
NOVEMBER/DECEMBER1994
82
sonofthe authors’ results tothose in previous a much larger number of journals. The editors
studies, whether published in theJMCB or else- of22 journals, however, declined invitations
where.~Thecollection of databy professional fromNSF’s economics program to request that
0
journals also remains rare. The JMCB discontin- their authors place data in the ICPSRarchive.
ued requesting data from authors in 1993. To The National Science Foundation has also
our knowledge, today only two journals—the adopted guidelines to reduce the cost of replica-
Journal ofApplied Econometrics and theJournal tions. The guidelines require that authors place
ofBusiness andEconomic Statistics—routinely any data used and/or developed in conjunction
request data from authors, and neither requests with an NSF-funded project in a public archive
their programming. not later than six months following the end of
FromJanuary 1983 through mid-1989, theJMCB the grant period. Applications for additional
received nearly 150 submitted datasets and about NSF funds must contain a statement ofhow the
300 requests, as shown in Table i.7 Except for a author has complied with this requirement.
surge in requests following the publication of TheICPSR accepts data from any author who
Dewald, Thursby and Anderson(1986), on balance has received NSF funds.°
only a few datasets were requested each month National Science Foundation initiatives have
eventhough the number ofavailable datasets also assisted users of copyrighted and confidential
increased significantly during the decade, data. Somedata obtained by researchers from
The higher request rate during the past two commercial vendors are copyrighted and may
years for data from the St. Louis bulletin board not be further distributed by the researcher
may suggest that the modest costs ofrequesting without the vendor’s permission. One such
data from theJMCB still exceeded the marginal vendor, the Centerfor Research in Security Prices
valueto an individual researcher of replicating (CRISP), has agreed to maintain researchers’
a previous study. To obtain data fora]MCB article, datasets as part of its own database and make
a readerhad to call the editorial office to ask the them available to all licensed users of its data.
priceofthe data, submit payment by mail, wait for Forconfidential data, the Bureau of the Census
thedatatobereproduced and mailed, and perhaps and the NSF are exploring opening regional
re-enter the data into a computer. By contrast, offices that would allow researchers access to
the St. Louis bulletin board is free, delivery is confidential data, including datasets used in
immediate, and data are machine-readable. previously published studies. A pilot office
is operating in Boston.
THE ROLE OF THE NATIONAL REPLICATION OF THE JMGB PROJECT
SCIENCE FOUNDATION IN ST. LOUIS
The economics program of the National Ourexperience at the JM~JBduring 1982-84
Science Foundation (NSF) has sought to build was itself only onetrial of an experiment. Would
a heightened awareness of the value of data another sample of authors also have difficulty
collection, archival and distribution among providing data following publication of their
economists during the last decade. Following articles? Or were our original findings anom-
publication of Dewald, Thursby and Anderson alous, leading us to greatly exaggerate the
(1986), the NSF established an archive for the problem, as some critics have suggested?
storage and distribution of authors’ data at the
Inter-university Consortium for Political and During 1992-93, we repeated theJMCBexperi-
Social Research (ICPSR) at the University of ment at St. Louis. in part, by requesting data and
Michigan. Initially, some anticipated that the programs from the authors ofpapers presented
NSF’s effort would extend the JMCB’s practice at the Bank’s annual economic policy conference
ofrequesting and distributing authors’ data to in October 1992. We did not tell authors prior
Replication also has been relatively rare even in journals o Materialsshould be submitted to User Support, ICPSR, P.O.
that encourage submission of such papers- See Fuess Box 1248, Ann Arbor, MI 48106. Data may be retrieved
(1994). from ICPSR via an Internet server; see Goffe (summer
Recall that in 1982 we began requesting data for articles 1994; March 1994).
that had been published asearly as 1980.
o Some authors havesince proposed models ofhow such col-
lective disinterest among professionaljournals might arise
and be sustained. See Feigenbaum and Levy (1994, 1993),
FEDERALRESERVE BANKOF ST. LOUiS
no reviews yet
Please Login to review.