[CollEc] helos overload

Düben, Christian christian.dueben at uni-hamburg.de
Sun Jul 30 13:12:42 UTC 2023


I added a cron job removing crashed app containers

*/30 * * * * docker ps --filter "ancestor=collec_app" --filter "status=exited" | xargs docker rm

I also stopped and removed the running app containers. Obviously, I do not add that as a cron job as that would also abort legitimate user sessions.

Robots continue to be a problem. The current system is not well suited to distinguish robots from human users. This should be managed at the middleware level which I do not develop myself.

Robots will be much less of an issue with CollEc 3.0. Though, as you know, I will only work on that towards the end of the year.

Christian Düben
Doctoral Candidate
Chair of Macroeconomics
Hamburg University
Germany
christian.dueben at uni-hamburg.de
https://www.christian-dueben.com

-----Original Message-----
From: CollEc-run <collec-run-bounces at lists.openlib.org> On Behalf Of Thomas Krichel
Sent: Sonntag, 30. Juli 2023 14:09
To: CollEc Run <collec-run at lists.openlib.org>
Cc: spider-feedback at bytedance.com
Subject: Re: [CollEc] helos overload

  Thomas Krichel writes

> Now I will reboot helos.

  Reboot is a succes but bytespider is still there.

krichel at trabbi~$ GET https://collec.repec.org/robots.txt
# makes requests faster than we can handle
User-agent: Bytespider
Disallow: /
# Crawl-delay: 10

 
-- 
  Written by Thomas Krichel http://openlib.org/home/krichel on his 21240th day.

_______________________________________________
CollEc-run mailing list
CollEc-run at lists.openlib.org
http://lists.openlib.org/cgi-bin/mailman/listinfo/collec-run



More information about the CollEc-run mailing list