just a heads up to anybody who does not want to be part of yet another “research project” into fediverse, in this case defederations
these people are attempting to scrape the /timelines/local and /api/v1/timelines/public routes to scrape public data. if you do not want these, you can block their user agent in nginx/cloudflare/etc. we already prevent access to /timelines/* by unauthenticated users so this wasn’t an issue for us. here is a link to the project and their ip/user agents from our logs
https://wiki.communitydata.science/CommunityData:Fediverse_research
129.105.31.75 - - [25/Oct/2023:02:20:39 +0000] "GET /api/v1/timelines/public?limit=1 HTTP/2.0" 401 52 "-" "https://wikicommunitydata.science/CommunityData:Fediverse_research" 129.105.31.75 - - [25/Oct/2023:02:20:45 +0000] "GET /api/v1/timelines/public?limit=1 HTTP/2.0" 401 52 "-" "https://wikicommunitydata.science/CommunityData:Fediverse_research" 129.105.31.75 - - [25/Oct/2023:02:20:52 +0000] "GET /api/v1/timelines/public?limit=1 HTTP/2.0" 401 52 "-" "https://wikicommunitydata.science/CommunityData:Fediverse_research" 129.105.31.75 - - [25/Oct/2023:02:20:58 +0000] "GET /api/v1/timelines/public?limit=1 HTTP/2.0" 401 52 "-" "https://wikicommunitydata.science/CommunityData:Fediverse_research" 129.105.31.75 - - [25/Oct/2023:02:21:03 +0000] "GET /api/v1/timelines/public?limit=1 HTTP/2.0" 401 52 "-" "https://wikicommunitydata.science/CommunityData:Fediverse_research"