Perplexity AI exposed for stealthily scraping the web, dodging no-crawl rules

Update: 2025-08-05

Description

Cloudflare exposes Perplexity AI’s stealth crawling tactics

Perplexity’s crawlers bypass common no-crawl directives (robots.txt) by switching from declared bot user agents to generic browser strings, primarily mimicking Chrome on macOS.

When blocked, Perplexity rotates IP addresses and ASNs outside their official ranges to evade detection, violating ethical web crawling norms.

Cloudflare’s tests with private domains blocking all crawlers still showed Perplexity returning detailed data, indicating covert scraping.

Cloudflare responded by delisting Perplexity as a verified bot and deploying managed rules—available even on free plans—to detect and block these evasive crawlers.

The case highlights tensions between AI companies’ aggressive data harvesting for training and the web ecosystem’s control measures, underscoring the need for transparent bot behavior standards.

“Objects should shut the fuck up” — critique of excessive device noise

Modern consumer products like cars, washing machines, and baby monitors produce intrusive, often unnecessary audible alerts with minimal user control or configurability.

Examples include persistent, startling LPG warnings in cars and non-disableable beeps on every washing machine control interaction, increasing user annoyance and potentially reducing safety.

The author’s frustrated tone underscores widespread alert fatigue caused by default sounds that prioritize notifications over user context or wellbeing.

Exceptions praised are devices with subtle, considerate alerts, such as dishwashers opening their doors silently after cycles or silent e-readers.

This calls for design philosophies that prioritize user control and reduce noise pollution in everyday technology.

Could interstellar object 3I/ATLAS be alien technology?

Researchers analyzed the recently discovered 3I/ATLAS’s unusual orbital dynamics and non-gravitational acceleration, hypothesizing it might be a technological artifact with possible intelligence and intent.

The object’s orbital tilt and trajectories near inner planets are statistically improbable for random interstellar visitors and could enable stealthy Solar System access.

The paper entertains the idea of a “Dark Forest” scenario where advanced civilizations might behave hostilely, suggesting 3I/ATLAS could be benign or malign.

The authors treat the hypothesis primarily as a pedagogical exercise, emphasizing the importance of scientific openness to such testable but speculative ideas.

The study provokes debate on interpreting limited data about interstellar visitors and the implications for SETI and planetary defense.

ChatGPT in university writing classes: a year-long experiment

UVA professor Piers Gelly integrated ChatGPT use into his writing curriculum, tasking 72 students to critically engage AI tools rather than banning them.

Students viewed AI skeptically yet pragmatically, using it for brainstorming and editing while recognizing its tendency toward bland and hallucinated content.

Classroom discussions highlighted differences between AI-generated “romanticized” prose and more mundane human writing, sparking reflection on storytelling and creativity.

Faculty found AI useful for grading speed and assignment design, though students largely preferred human feedback; most agreed human instructors remain essential.

The experiment illustrates a nuanced “messy middle” where human creativity and AI support coexist, suggesting collaborative rather than adversarial futures in education.

Comments

In Channel

Stanford doubles down on legacy admissions, ditching Cal Grant funds to keep donor perks alive

2025-08-1014:22

OpenAI sparks uproar by abruptly retiring GPT-4o, but Sam Altman pledges its return for Plus users

2025-08-0911:45

OpenAI launches GPT-5, a smarter, faster AI expert team ready to revolutionize coding and work efficiency

2025-08-0815:09

Kitten TTS: Ultra-lightweight, offline text-to-speech for any device

2025-08-0715:40

OpenAI launches open-weight GPT-OSS models rivaling proprietary LLMs with full customization and chain-of-thought transparency

2025-08-0615:22

Perplexity AI exposed for stealthily scraping the web, dodging no-crawl rules

2025-08-05--:--

Remote teams boost creativity and connection with personal “ramblings” channels in chat apps

2025-08-04--:--

AI Companions Risk Diluting the Crucial Pain of Loneliness

2025-08-03--:--

tmux alternative shpool challenges terminal multiplexers for a simpler, modern workflow

2025-08-02--:--

Big Builders Aren’t Blocking Housing Supply — It’s Zoning Rules Holding Us Back

2025-08-01--:--

Australia bans YouTube for under-16s to protect teens from addictive and low-quality content

2025-07-31--:--

ChatGPT Study Mode turns AI into your personal, interactive 24/7 tutor

2025-07-30--:--

Former US special forces officer breaks silence on Israeli forces shooting unarmed civilians at Gaza aid sites

2025-07-29--:--

EU’s age verification app locks out non-Google Androids, sparking digital sovereignty fears

2025-07-28--:--

Microsoft Copilot's Python sandbox rooted by path hijacking vulnerability in containerized environment

2025-07-27--:--

Steam and Itch.io face backlash over adult game bans driven by payment processors' censorship pressure

2025-07-26--:--

Intel’s bold reset: 15% layoffs, SMT returns, and a sharp AI pivot

2025-07-25--:--

When Privacy Is a Crime: Spanish Police Target Google Pixel Users Running GrapheneOS

2025-07-24--:--

Qwen3-Coder sets new agentic coding records with 480B params and 1M token context length

2025-07-23--:--

Anker recalls 1M+ power banks over fire hazards revealing hidden supply chain and quality control risks

2025-07-22--:--

00:00

1.0x

Perplexity AI exposed for stealthily scraping the web, dodging no-crawl rules

info@thepodcastcollective.com

#box-pro-ellipsis-176273105275685{-webkit-line-clamp:2;}Perplexity AI exposed for stealthily scraping the web, dodging no-crawl rules

Cloudflare exposes Perplexity AI’s stealth crawling tactics

“Objects should shut the fuck up” — critique of excessive device noise

Could interstellar object 3I/ATLAS be alien technology?

ChatGPT in university writing classes: a year-long experiment

Perplexity AI exposed for stealthily scraping the web, dodging no-crawl rules

info@thepodcastcollective.com

Perplexity AI exposed for stealthily scraping the web, dodging no-crawl rules