[ArchEc] plid stats

Thomas Krichel krichel at openlib.org
Sat Dec 12 14:37:29 UTC 2020


  I just wrote a few lines of python to summarize the plind

archec at darni:$ plind_stats
1071893 papers
1447873 PDF payloads
1226498 plodis

  The plodi is a playload digest. So basically this tells you how many
  different payloads we have. My policy is to duplicate payloads if
  they belong to different papers.  While this wastes disk space,
  anything else would make it harder of consumers of the data.
  Having hit over a million on all figures is good, it should
  make the funders happy.
   
-- 

  Cheers,

  Thomas Krichel                  http://openlib.org/home/krichel
                                              skype:thomaskrichel



More information about the ArchEc-run mailing list