[CollEc] Network Dataset

Thomas Krichel krichel at openlib.org
Fri Jan 18 03:33:40 UTC 2019


  Marina Azzimonti wrote:

> Alessandra Fogli, Veronica Guerrieri and I are organizing a "Women in
> Macro" conference. We did one last year and besides talking about
> economics, we like to have a short session to discuss gender issues in
> general. One topic that often arises is that women have a harder time
> "networker" than men do.
> 
> One way to test this theory is to compare network characteristics of males
> and females. I noticed you guys at RePec (and Thomas at CollEC) have
> constructed a nice dataset which would allow to check this out. From what I
> see online, for each person on RePEC/Ideas you compute closeness and
> betweenness. Of course one would like to control for characteristics such
> as time since graduation, and perhaps sort by "quality". I see that it is
> possible to download a lot of this online, but I was wondering if you had
> an organized dataset already constructed you could share with me. This
> would save us a ton of time.
> 
> What I think we would need is:
> 
> - Name and Last name
> - Repec id
> - Affiliation
> - PhD year
> - Location (thta is, are they US or not, etc)
> - Network measures: closeness and betweenness (rank and value)
> - Gender
> - Various ranking measures
> - # of publications
> - fields
> - this is a stretch: list of coauthors
> 
> I am not sure if there is any other information available, but this is what
> I see appears online.
> 
> I could write a script to get the data from the site, but only the top 10%
> is available I believe.
> 
> Anyways, let me know if this (or a subset of this) would be feasible. I
> think if we find something interesting it would be good to promote the
> ideas and CollEc datasets. Not many people knows about the network data and
> I can foresee a lot of interesting applications with it.

  I can deliver the full CollEc data via rsync. I just set it up

krichel at trabbi~/collec$ rsync -va rsync://collec.repec.org/table collec.txt
receiving incremental file list
created directory collec.txt
./
CollEc.txt

sent 46 bytes  received 8,097,988 bytes  2,313,724.00 bytes/sec
total size is 8,095,907  speedup is 1.00

  This tabular data.  It updated every day.  I already deliver full
  path data to Nikos. It's bulky

icanis at katri:~/icanis/paths/ras$ du -s biwe
140757804       biwe

  If you make it to the city before Mach, we could meet. Or
  if you can pay my LIRR tickets, I can come to Stony Brook
  and give a talk about this work. 


-- 

  Cheers,

  Thomas Krichel                  http://openlib.org/home/krichel
                                              skype:thomaskrichel



More information about the CollEc-run mailing list