[OAI-eprints] Re: The RePEc (Economics) Model
Thomas Krichel
krichel@openlib.org
Wed, 19 Mar 2003 22:16:33 +0200
Stevan Harnad writes
> The Repec model is one in which many distributed institutions,
> each having archives of multiple economics papers of
> their own, have their metadata gathered together and
> enriched to provide OAI-like interoperability: http://repec.org/
The interoperability is more complicated then in a conventional
OAI setting, because the structure of the data exchanged goes will
beyond what can be done with oai_dc.
> Instead of using the OAI protocol, Repec uses the "Guildford"
> protocol -- ftp://netec.mcc.ac.uk/pub/NetEc/RePEc/all/root/docu/guilp.html --
> but it has been announced that Repec plans to become OAI-compliant
> eventually.
I already operate a gateway at http://oai.repec.openlib.org. It's
oai_dc data may be a bit thin, but there is plenty of AMF metadata.
> (Repec does *not*, as I had wrongly assumed, cover individual
> websites too, as ResearchIndex/citeseer
> http://citeseer.nj.nec.com/cs does, only multi-paper institutional
> archives.)
Departmental archives, as distinguished from institutional archives.
Some archives serve special purposes, they hold no docuemnt
data at all.
> Repec is accordingly a form of institutional self-archiving,
> pre-dating the OAI, but (1) focused on one discipline only
> (economics), and (2) not requiring the individual archives to be
> OAI-compliant (but Guildford-compliant).
Correct, which is basically just a way to dump files on a disk,
nothing more.
> It is a very activist project, "a collaborative
> effort of over 100 volunteers in 30 countries to enhance the dissemination
> of research in economics."
Correct, and almost all are economics faculty. Some folks do
little, but the construction of the whole enterprise means that
even if they do little, since there are many
> It should be noted at once that if every discipline had its own
> institutional Guildford-compliant archives and volunteers, as Economics
> has, then I and many others would today be promoting Institutional
> Guilford-compliant repositories rather than Institutional OAI-compliant
> repositories (and the free software that Southampton designed for creating
> OAI-compliant institutional repositories for self-archiving
> http://www.dlib.org/dlib/october00/10inbrief.html would have
> been Guildford-compliant software).
The technical protocol for the transport matters little. This
really (!) is a technical matter. We continue with what we
got because we can not rearrange 250+ archives that otherwise
do just fine.
> What distinguished Repec is hence not its interoperability protocol
> (since it plans to become OAI-compliant anyway) but (a) its activism
> and (b) its discipline-specificity.
and (c) its metadata model. This is by far the most important, but least
well understood distinction.
> If there were a way to spread Repec's activism from economics to the
> other disciplines, it would certainly be very welcome, just as it
> would be very welcome if there were a way to spread ArXiv's
> central-archiving tendency to the other disciplines.
Could not agree more.
> Unfortunately, no such generalization of either Repec or Arxiv to the
> other disciplines has taken place (Repec began in 1997, Arxiv in 1991).
RePEc has its origin in a project called WoPEc that I started on
February 1, 1993. In 1997, RePEc was born essentially out of WoPEc
and some other partners, but WoPEc had the lion's share (I am
simplifying here a bit.)
> http://www.earlham.edu/~peters/fos/timeline.htm It is for this
> reason that it is OAI-compliant institutional self-archiving that I
> happen to be promoting. And this is at last showing signs of
> generalizing http://www.ecs.soton.ac.uk/~harnad/Temp/tim-arch.htm
> though still not fast enough. It is for that reason that various
> forms of activism need to be promoted too, especially institutional
> activism:
There is no contradicition between institutional and departmental
archives, and aggregator strutures. It is by no means an either
or choice. And let me emphasise again: having discipline-based
aggregators will be the best way to stimulate institutional
and departmental archiving. The problem is, of course, that
there are not many aggregators around. Therefore I have been
argueing for a while thet the institutional self-archiving
community should stick together to elect one area of discplinary
priority. That is rather that to fight a war on all fronts,
concentrate the effort and build systems that are interoperable
beyond the unqualified DC data model. The DC data model is too simple
for academic self-documentation.
> At first, FTP sites and Web sites seemed the simplest, fastest and
> most direct way for researchers to self-archive, on a distributed,
> institutional basis;
They still are, just look at the amount of stuff that is on the
web. There are so many grass-roots initiatives. The larger
public is not aware of them because they serve specific communities.
This is where I get so angry with Clifford and his---implicit---call
to shut them down, to fit all publishing activities into a central
straightjacket.
> but then the slow progress in this, and the success of the
> physicists' centralized disciplinary model suggested that
> centralized, discipline-based self-archiving might be faster, with
> the Physics Arxiv itself perhaps subsuming it all
> http://cogprints.soton.ac.uk/documents/disk0/00/00/16/99/ (Thomas
> Krichel argued against central archiving,
Nope. I simply argued that the centralized model would not
carry through to many disciplines. Where it worked it
was certainly an extremely good model. But you insisted
that because the Physcists had done it everyone could
and would, it was the optimal way (your flavour of the day).
But I am still right. arXiv has a very unequal distribution
of papers even in sub-areas of Physics, I am told. Ebs will
know better. arXiv is still growing and that is a good thing.
> But central archiving did not catch on (Cogprints has only reached
> 1500 papers in 2003) or generalize to other disciplines,
Exactly as I had forecasted! And that, depite the fact that
it was a project subsidized by public funds. When WoPEc became
a funded project, by the same funders, it had around 5,000
papers accumulated as a labor of love, only. Much of that
work was done by José Manuel Barrueco Cruz.
> and Arxiv itself kept growing at only an unchanged linear rate from
> year to year: http://arxiv.org/show_monthly_submissions
Sure, but it is still is the finest self-archiving project on the planet.
But it really is self-archiving. Self-archiving is only a part
of what I call self-documentation.
> And then came the OAI protocol in 1999, making distributed
> self-archiving equivalent to central (because of interoperability)
> http://www.openarchives.org/documents/index.html
They are not quite, but that is a matter for another email...
> which immediately prompted me to ask Rob Tansley to redesign the
> Cogprints software to make it OAI-compliant and then turn it into
> free generic OAI archive-creating software for institutions
> http://www.dlib.org/dlib/october00/10inbrief.html
And I think your team are doing a very good job with this.
> I think I now understand this. See above. Both Repec's
> aggregation of institutional multi-paper archives in economics and
> Citeseer/ResearchIndex's harvesting of arbitrary individual websites
> in computer science
Citeseer are a truely fab project. The material that is there
should become part of new, RePEc-like data structure called
rclis and pronounced "reckless". Watch out for it over the
next few years.
With greetings from Minsk, Belarus,
Thomas Krichel http://openlib.org/home/krichel
RePEc:per:1965-06-05:thomas_krichel