NewsWorld
PredictionsDigestsScorecardTimelinesArticles
NewsWorld
HomePredictionsDigestsScorecardTimelinesArticlesWorldTechnologyPoliticsBusiness
AI-powered predictive news aggregation© 2026 NewsWorld. All rights reserved.
Trending
CrisisInfrastructureStrikesIranTrumpNuclearFebruaryNewsMilitaryReachedLimitedDigestTimelineTrump'sDaysAnnounceDailyTariffsProtestsGreenlandChallengeEuropeanLongevityEmergency
CrisisInfrastructureStrikesIranTrumpNuclearFebruaryNewsMilitaryReachedLimitedDigestTimelineTrump'sDaysAnnounceDailyTariffsProtestsGreenlandChallengeEuropeanLongevityEmergency
All Articles
Facebook's Fascination with My Robots.txt
Hacker News
Published about 5 hours ago

Facebook's Fascination with My Robots.txt

Hacker News · Feb 23, 2026 · Collected from RSS

Summary

Article URL: https://blog.nytsoi.net/2026/02/23/facebook-robots-txt Comments URL: https://news.ycombinator.com/item?id=47121210 Points: 28 # Comments: 6

Full Article

For the past 4 days — and probably more since I don't have logs beyond that — Facebook has been hitting the /robots.txt of my self-hosted Forgejo instance several times per second. The user-agent is facebookexternalhit/1.1 (+http://www.facebook.com/externalhit_uatext.php). I expected the UA header to be nontrustworthy, but all the requests are also coming from Meta's IP address ranges. The interesting thing is that no other file is being accessed. Just robots.txt over and over and over again. Facebook's documentation states: The primary purpose of FacebookExternalHit is to crawl the content of an app or website that was shared on one of Meta’s family of apps, such as Facebook, Instagram, or Messenger. The link might have been shared by copying and pasting or by using the Facebook social plugin. This crawler gathers, caches, and displays information about the app or website such as its title, description, and thumbnail image. Now, as tempting as it is to think that I've suddenly reached unfathomable levels of popularity on Meta's platforms, I find it difficult to believe as the only other traffic on my instance are the AI bots consistently crawling the qmk_firmware repository and the very occasional user of one of my Hex packages. And myself. Not even Facebook themselves are requesting any other path at the moment, just robots.txt. Here's the accesses I'm getting, visualised in two ways for your convenience: This chart provided by my extreme LibreOffice Calc skillz. Data is grouped by hour. Click the image to open in full size. [Insert Matrix quote here.] So what's going on at Meta? Why are they so obsessed with my very bog standard robots.txt file? I'm a nobody and surely not interesting enough that they'd only be targeting me specifically, so how much bandwidth and energy are they using globally to mass request robots.txt files in a never ending loop? Perhaps someone at their end screwed up a loop conditional, but you'd think some monitoring dashboard somewhere would have a warning pop up because of this. Anyway, compared to the earlier AI bot onslaught, this traffic is mostly benign for myself, just interesting. As long as it doesn't continue picking up speed.


Share this story

Read Original at Hacker News

Related Articles

Hacker Newsabout 2 hours ago
NASA uses Mars Helicopter's SoC for rover navigation upgrade

Article URL: https://www.theregister.com/2026/02/23/perseverance_rover_soc_navigation_upgrade/ Comments URL: https://news.ycombinator.com/item?id=47123321 Points: 7 # Comments: 1

Hacker Newsabout 2 hours ago
The peculiar case of Japanese web design

Article URL: https://sabrinas.space Comments URL: https://news.ycombinator.com/item?id=47122789 Points: 66 # Comments: 20

Hacker Newsabout 3 hours ago
The Age Verification Trap, Verifying age undermines everyone's data protection

Article URL: https://spectrum.ieee.org/age-verification Comments URL: https://news.ycombinator.com/item?id=47122715 Points: 207 # Comments: 136

Hacker Newsabout 3 hours ago
How in the Hell Did Joann Fabrics Die While Best Buy Survived? It Wasn't Amazon

Article URL: https://www.governance.fyi/p/how-in-the-hell-did-joann-fabrics Comments URL: https://news.ycombinator.com/item?id=47122337 Points: 5 # Comments: 2

Hacker Newsabout 4 hours ago
VTT Test Donut Lab Battery Reaches 80% Charge in Under 10 Minutes [pdf]

Article URL: https://pub-fee113bb711e441db5c353d2d31abbb3.r2.dev/VTT_CR_00092_26.pdf Comments URL: https://news.ycombinator.com/item?id=47121864 Points: 23 # Comments: 18

Hacker Newsabout 4 hours ago
femtolisp: A lightweight, robust, scheme-like Lisp implementation

Article URL: https://github.com/JeffBezanson/femtolisp Comments URL: https://news.ycombinator.com/item?id=47121539 Points: 39 # Comments: 5