[CollEc] Network Dataset
Thomas Krichel
krichel at openlib.org
Fri Jan 18 03:33:40 UTC 2019
Marina Azzimonti wrote:
> Alessandra Fogli, Veronica Guerrieri and I are organizing a "Women in
> Macro" conference. We did one last year and besides talking about
> economics, we like to have a short session to discuss gender issues in
> general. One topic that often arises is that women have a harder time
> "networker" than men do.
>
> One way to test this theory is to compare network characteristics of males
> and females. I noticed you guys at RePec (and Thomas at CollEC) have
> constructed a nice dataset which would allow to check this out. From what I
> see online, for each person on RePEC/Ideas you compute closeness and
> betweenness. Of course one would like to control for characteristics such
> as time since graduation, and perhaps sort by "quality". I see that it is
> possible to download a lot of this online, but I was wondering if you had
> an organized dataset already constructed you could share with me. This
> would save us a ton of time.
>
> What I think we would need is:
>
> - Name and Last name
> - Repec id
> - Affiliation
> - PhD year
> - Location (thta is, are they US or not, etc)
> - Network measures: closeness and betweenness (rank and value)
> - Gender
> - Various ranking measures
> - # of publications
> - fields
> - this is a stretch: list of coauthors
>
> I am not sure if there is any other information available, but this is what
> I see appears online.
>
> I could write a script to get the data from the site, but only the top 10%
> is available I believe.
>
> Anyways, let me know if this (or a subset of this) would be feasible. I
> think if we find something interesting it would be good to promote the
> ideas and CollEc datasets. Not many people knows about the network data and
> I can foresee a lot of interesting applications with it.
I can deliver the full CollEc data via rsync. I just set it up
krichel at trabbi~/collec$ rsync -va rsync://collec.repec.org/table collec.txt
receiving incremental file list
created directory collec.txt
./
CollEc.txt
sent 46 bytes received 8,097,988 bytes 2,313,724.00 bytes/sec
total size is 8,095,907 speedup is 1.00
This tabular data. It updated every day. I already deliver full
path data to Nikos. It's bulky
icanis at katri:~/icanis/paths/ras$ du -s biwe
140757804 biwe
If you make it to the city before Mach, we could meet. Or
if you can pay my LIRR tickets, I can come to Stony Brook
and give a talk about this work.
--
Cheers,
Thomas Krichel http://openlib.org/home/krichel
skype:thomaskrichel
More information about the CollEc-run
mailing list