Sources & Bibliography
References, deep dives, and further reading for Vol 2: Stuff We Built on Top.
About Link Rot
Link rot is the phenomenon where hyperlinks break or disappear over time.
To mitigate this, the following steps have been taken:
- Every source includes an “Accessed” date indicating when the author last verified the link.
- Whenever technically possible, an “Archive” link (hosted by the Wayback Machine) is provided.
001 Topic
7 sources
- Stephen King Forum 2017 Accessed: 2026-05-10
Discussion about plot holes in Stephen King's work. The book itself is under copyright.
- Westeros.org 2017 Accessed: 2026-05-10
Similar to King's work, Martin's writing is still protected by copyright.
- Planet Publish Daniel Defoe 1719 Accessed: 2026-05-10
Chapter 4: information about removing clothes appears on page 76, pocket-stuffing is described on page 77.
- LibreOffice Accessed: 2026-05-10
One possible source for downloading the tar.gz package with LibreOffice source code.
- GitHub Linus Torvalds Accessed: 2026-05-10
Official repository for the Linux kernel.
Note: Linus is the primary maintainer and approver of changes but it would be a stretch to consider him the sole author.
- Reddit 2024 Accessed: 2026-05-10
Analysis of the ratio between actual code size and change history.
- Wikipedia Accessed: 2026-05-10
Information about the Mahabharata. Size calculations in MB are author's rough estimates. The article directly compares the Mahabharata to the Odyssey and Iliad.
5 Chapters
228 Topics
6 sources
- Smithsonian Magazine Maris Fessenden 2015 Accessed: 2026-05-10
Article about Andy George's sandwich experiment.
- Time Mark Rivett-Carnac 2015 Accessed: 2026-05-10
Coverage of Andy George's experiment.
- Vice Abby Moss 2015 Accessed: 2026-05-10
Interview with Andy George about the sandwich experiment.
- TrendForce 2026 Accessed: 2026-05-10
Report on mobile component market.
- Investopedia Adam Hayes 2025 Accessed: 2026-05-10
Article describing supply chains in semiconductor industry.
- Counterpoint Research 2026 Accessed: 2026-05-10
Quarterly report on global smartphone application processor market share.
7 sources
- arXiv Pietro Abate, Roberto Di Cosmo, Georgios Gousios, Stefano Zacchiroli 2020 Accessed: 2026-05-10
Review of research on dependency solvers, comparison matrix of package managers, and discussion of CUDF.
- arXiv Ryan Gibb, Patrick Ferris, David Allsopp, Thomas Gazagnaire, Anil Madhavapeddy 2026 Accessed: 2026-05-10
Recent paper (February 2026) formalizing dependency resolution semantics as Package Calculus, showing the problem is NP-complete.
- arXiv Ryan Gibb, Patrick Ferris, David Allsopp, Michael Winston Dales, Mark Elvers, Thomas Gazagnaire, Sadiq Jaffer, Thomas Leonard, Jon Ludlam, Anil Madhavapeddy 2025 Accessed: 2026-05-10
Continuation of the above work, more practical, explains differences across ecosystems.
- SemVer Tom Preston-Werner Accessed: 2026-05-10
The only canonical source for SemVer. Written by GitHub co-founder. Short, precise, with examples.
- npm Docs 2026 Accessed: 2026-05-10
How npm interprets SemVer in practice (tilde ~, caret ^).
- Andrew Nesbitt's Blog Andrew Nesbitt 2026 Accessed: 2026-05-10
Author runs ecosyste.ms and libraries.io. Catalogued resolver algorithms (SAT, PubGrub, ASP, backtracking) with concrete implementations per package manager. Best available synthesis for technical readers.
- Sourcegraph Blog Matt Rickard 2021 Accessed: 2026-05-10
Best popular technical text on the topic. Clear typology of dependency circles with technical examples. Written by a Google/Kubernetes engineer.
5 sources
- arXiv Mojtaba Shahin, Muhammad Ali Babar, Liming Zhu 2017 Accessed: 2026-05-10
- Martin Fowler's Bliki Martin Fowler 2013 Accessed: 2026-05-10
Short, precise definition from the person who brought these concepts to the mainstream. Co-creator of Continuous Delivery (Humble & Farley, 2010).
- Atlassian Sten Pittet Accessed: 2026-05-10
Most widely cited practical guide to distinguishing the three phases. Clear, with diagram. Used as a reference standard by most teams.
- GitLab Accessed: 2026-05-10
Good definition from one of the most popular CI/CD platforms. Clearly distinguishes continuous deployment (automatic) from continuous delivery (can be deployed).
- romenrg.com Romén Rodríguez-Gil 2017 Accessed: 2026-05-10
Technical blog with personal response from Martin Fowler in comments. Best practical text focused solely on distinguishing the three phases.
9 sources
- npm Blog 2016 Accessed: 2026-05-10
Official npm post-mortem. Contains timeline, decision about 'un-unpublishing,' and Laurie Voss quote: 'Un-un-publishing is an unprecedented action.' Ground truth for the entire incident.
- Azer Koçulu's Blog Azer Koçulu 2016 Accessed: 2026-05-10
Author's own statement. Indirectly quoted in most articles, important for understanding his motivation.
- The Register Chris Williams 2016 Accessed: 2026-05-10
Live coverage from the day of the incident. Contains left-pad source code, Laurie Voss quote, and data: 'Left-pad was fetched 2,486,696 times in just the last month.' Solid primary press source.
- InfoQ Abel Avram 2016 Accessed: 2026-05-10
Precise confirmation: Babel, Atom, Ember, React Native as affected projects. Good technical synthesis from the next day.
- LWN.net Nathan Willis 2016 Accessed: 2026-05-10
Most analytical of the hot-take texts. Confirms React.js, Babel, and Ember.js as affected frameworks. Important observation: 'no major interruptions appear to have hit live services' — builds failed but end users mostly noticed nothing.
- David Haney's Blog David Haney 2016 Accessed: 2026-05-10
Engineer writing on the day of the incident. Contains direct reference to React: 'Just ask the React team how well their week has been going.' Good as a quotable community voice.
- ScienceAlert Fiona Macdonald 2018 Accessed: 2026-05-10
Confirms Facebook, Netflix, Spotify via Babel and React. Quotes npm CEO Isaac Schlueter from Ars Technica.
- Wikipedia Accessed: 2026-05-10
Wikipedia article describing the incident. Contains source code — note that including empty lines the package had 17 lines, which some articles referenced.
- Biggo 2025 Accessed: 2026-05-10
Article published June 11, 2025, shedding different light on the event by revealing that the script to delete all packages was provided to the author by npm.
9 sources
- NVD/NIST 2021 Accessed: 2026-05-10
Official entry in US government's National Vulnerability Database. CVSS 10.0, technical description, list of affected versions (2.0-beta9 to 2.14.1), patch links. Canonical source for any CVE citation.
- LunaSec (GitHub) freeqaz 2021 Accessed: 2026-05-10
The original post-mortem — LunaSec coined the name 'Log4Shell' and was first to publish detailed technical analysis. Updated over several weeks as subsequent CVEs were discovered. Cited by CISA, Apache, and practically everyone else.
Note: At time of compilation, lunasec.io domain is inactive (DNS issue). Blog sources are available on GitHub. Linked file is source in MDX format, fully readable but note this is not the actual published page.
Fun fact: blog uses roughly the same tech stack as the bibliography you're reading now. Except this site's repo is not public.
- CISA 2021 Accessed: 2026-05-10
Official US Cybersecurity and Infrastructure Security Agency page with Emergency Directive 22-02, links to all CVEs, and guidance for federal agencies. Good as institutional source.
- Lawfare Nicholas Weaver 2021 Accessed: 2026-05-10
Solid analysis for non-specialists by Lawfare (Harvard Law / Brookings). Describes Minecraft as the starting point, critical infrastructure impacts, and SBOM implications. Good for broader context.
- LA Times Frank Bajak 2021 Accessed: 2026-05-10
Coverage from disclosure day, includes quote from expert Marcus Hutchins about the Minecraft exploit. Good as mainstream confirmation of scale.
- Nippon.com Nishiue Itsuki 2026 Accessed: 2026-05-10
Official Japanese source — Nippon.com is funded by Japan Foundation. Description of the technique, its applications, and spread beyond Japan.
- Flight Safety Australia 2018 Accessed: 2026-05-10
Article from Australian aviation authority CASA. Good context for application of the technique beyond railways, includes bibliography with original scientific sources (Railway Technical Research Institute 1994).
- Auth0 Blog Sebastian Peyrott 2019 Accessed: 2026-05-10
Technically most accurate non-academic description of history: Mocha → LiveScript → JavaScript, role of Netscape/Sun agreement, quotes Brendan Eich. Often linked as canonical source.
- Wikipedia Accessed: 2026-05-10
For the naming footnote, sufficient as additional confirmation — contains quote from Eich himself who called the name change 'a marketing ploy by Netscape.'
5 sources
- NVD/NIST 2025 Accessed: 2026-05-10
Canonical government entry. CVSS 10.0, technical description, list of affected versions.
- React.dev 2025 Accessed: 2026-05-10
Official post from React team (Meta). Contains disclosure timeline, list of affected packages and frameworks, patching instructions, and — notably — list of all follow-up CVEs updated through January 26, 2026. Best single source for the entire saga.
- React.dev 2025 Accessed: 2026-05-10
Official React team post listing all follow-ups: CVE-2025-55183, CVE-2025-55184, CVE-2025-67779 (December), CVE-2026-23864 (January). Also includes a quote that could be used in the book: 'This pattern shows up across the industry, not just in JavaScript. For example, after Log4Shell, additional CVEs were reported as the community probed the original fix.'
- AWS Security Blog CJ Moses 2025 Accessed: 2026-05-10
Primary source for Jackpot Panda and Earth Lamia. Report from MadPot honeypot infrastructure. One of the best intelligence documents on who exploited and how.
- Wiz Research Gili Tikochinski, Merav Bar, Danielle Aminov 2025 Accessed: 2026-05-10
Wiz researchers published the first PoC and had the earliest cloud exploitation telemetry. Contains data: 39% of cloud environments with vulnerable versions, confirmation of AWS credential harvesting.
5 sources
- Sansec 2024 Accessed: 2026-05-10
Original discoverers' report — Sansec was the first company to publish attack details. Contains timeline, sample malicious script code, list of affected domains, and updates from subsequent days (Namecheap domain suspension, Cloudflare response). Only first-order canonical source for the entire story.
- BleepingComputer Lawrence Abrams 2024 Accessed: 2026-05-10
First extensive press coverage from disclosure day. Contains Google's communication to advertisers, Andrew Betts quotes, and attack mechanism details. BleepingComputer was the first medium to cover Sansec's report — hence the DDoS on their infrastructure in response.
- The Hacker News Ravie Lakshmanan 2024 Accessed: 2026-05-10
Good article with broader perspective — describes scale after a week, Funnull's restart attempt under polyfill.com domain (also taken down by Namecheap), and long-term implications. Good as supplement to live coverage.
- Censys 2024 Accessed: 2026-05-10
Source for the 384,773 hosts number. Also includes scan of alternative domains taken over by the same operator, historical DNS records, and list of affected platforms (Hulu, Mercedes-Benz, WarnerBros). Only credible source for this specific number.
- SecurityWeek Eduard Kovacs 2026 Accessed: 2026-05-10
Most widely cited medium covering Hudson Rock findings. Contains summary of forensic evidence (credentials from LummaC2 infostealer, domain configuration conversations), operation goal (cryptocurrency laundering via Suncity Group gambling network), and attribution certainty caveats. Well-balanced between 'this is strong evidence' and 'this is one research firm.'
4 sources
- Medium Alex Birsan 2021 Accessed: 2026-05-10
Original and only canonical source — Birsan himself describes methodology, timeline, code examples, list of vulnerable companies, and technical details. Everything else is secondary to this text.
- Cybersecurity Dive Samantha Schwartz 2021 Accessed: 2026-05-10
Solid press coverage from publication day. Quotes Birsan directly, describes Microsoft's response (whitepaper, CVE-2021-24105), and remediation recommendations. Good institutional context.
- Sonatype Ax Sharma 2021 Accessed: 2026-05-10
Technical and published simultaneously with Birsan's report — Sonatype as a supply chain security company had ready analysis on publication day. Best text for readers wanting to understand the mechanism without reading the Medium post.
- AppleInsider Mike Peterson 2021 Accessed: 2026-05-10
Most detailed breakdown of payouts: Microsoft $40,000 (program record), Shopify $30,000, Apple $30,000, PayPal $30,000, total over $130,000. Also quotes Apple confirmation that RCE on servers would have been possible. Good as verifiable source for specific numbers.
235 Topics
9 sources
- OpenSSL 2014 Accessed: 2026-05-11
Official OpenSSL security bulletin issued on the day of disclosure. Short, technical, authoritative.
- NVD / NIST 2014 Accessed: 2026-05-11
Official NIST/NVD record for Heartbleed.
- heartbleed.com 2014 Accessed: 2026-05-11
Dedicated site describing the Heartbleed vulnerability — created at the time of disclosure.
- Cloudflare Blog Nick Sullivan 2014 Accessed: 2026-05-11
Cloudflare's Heartbleed Challenge confirmed that private keys could indeed be stolen via the vulnerability.
- Google Cloud Blog Christopher Glyer, Chris DiGiamo 2014 Accessed: 2026-05-11
Google Cloud post describing real-world exploitation of Heartbleed to bypass multi-factor authentication on VPNs.
- The Guardian Alex Hern 2014 Accessed: 2026-05-11
Contains the responsible developer's own explanation of how the bug was introduced.
- The Verge Russell Brandom 2014 Accessed: 2026-05-11
- Wired Robert McMillan 2014 Accessed: 2026-05-11
- The Spokesman-Review Daniel Gayle 2014 Accessed: 2026-05-11
6 sources
- daniel.haxx.se Daniel Stenberg 2022 Accessed: 2026-05-11
The legendary post in which Stenberg publishes an ultimatum email from a corporation demanding an audit of a Java library that curl does not use. He describes the level of ignorance and incompetence as breathtaking. Best single source illustrating how compliance processes replace actual thinking.
- daniel.haxx.se Daniel Stenberg 2021 Accessed: 2026-05-11
Stenberg publishes a death threat received because curl 'didn't work correctly' in a user's closed system. Illustrates the extreme end of free support expectations — users who treat maintainers' time as their entitlement.
- daniel.haxx.se Daniel Stenberg 2025 Accessed: 2026-05-11
Explains why Stenberg shut down the curl Bug Bounty program. He describes AI-generated security reports as a 'DDoS on human attention' — technically plausible-looking but factually wrong reports flooding maintainers at scale.
- LWN.net Jonathan Corbet 2025 Accessed: 2026-05-11
- curl.se Daniel Stenberg Accessed: 2026-05-11
List of companies using curl, maintained by the tool's author.
- wolfSSL 2019 Accessed: 2026-05-11
Stenberg's company offering paid commercial support for curl.
7 sources
- Reddit 2022 Accessed: 2026-05-11
Reddit thread containing a screenshot of the faker.js repository after Marak's deletion. Primary contemporaneous record since the original repo no longer exists on GitHub.
- Web Archive 2022 Accessed: 2026-05-11
Archived snapshot showing the entire repository history reduced to a single commit.
- npm Marak Squires 2022 Accessed: 2026-05-11
npm registry entry. Worth noting the version number at the time of the sabotage.
- The Register Thomas Claburn 2022 Accessed: 2026-05-11
- Revenera Marcus Lucero 2022 Accessed: 2026-05-11
- Heimdal Security Dora Tudor 2022 Accessed: 2026-05-11
- BleepingComputer Ax Sharma 2022 Accessed: 2026-05-11
7 sources
- oss-security (Openwall) Andres Freund 2024 Accessed: 2026-05-11
Original disclosure by Andres Freund. The primary source for the entire XZ backdoor story.
- boehs.org Evan Boehs 2024 Accessed: 2026-05-11
Detailed attack timeline analysis. Widely cited reconstruction of Jia Tan's multi-year social engineering campaign.
- TH Brandenburg (OPUS4) Conrad Ferneding 2024 Accessed: 2026-05-11
Academic bachelor's thesis with extensive technical breakdown and references. Detailed analysis of the backdoor mechanism and attack chain.
- Wired Dan Goodin 2024 Accessed: 2026-05-11
- Red Hat Blog 2024 Accessed: 2026-05-11
- Debian Security Announce Salvatore Bonaccorso 2024 Accessed: 2026-05-11
Debian maintainers noted how Jia Tan attempted to manipulate them into ignoring valgrind warnings as 'false positives'. Debian rolled back all XZ packages to the safe 5.4.5 version and revoked Jia Tan's infrastructure access. The incident showed that Debian's 'bureaucracy' functioned as a protective filter.
- Ubuntu Security 2024 Accessed: 2026-05-11
Canonical confirmed the malicious XZ version reached Noble Numbat repositories but was removed before the official LTS release thanks to Freund's discovery. Had it shipped, millions of corporate servers would have received a backdoor as part of a 'secure' Long-Term Support release.
244 Topics
4 sources
- Consumer Rights Wiki Accessed: 2026-05-12
- The Manual Matthew Denis 2023 Accessed: 2026-05-12
- The Sun (Malaysia) 2023 Accessed: 2026-05-12
- Motor1 Adrian Padeanu 2022 Accessed: 2026-05-12
10 sources
- Docker Blog Giri Sreenivas 2024 Accessed: 2026-05-12
Official announcement of the licensing change (August 31, 2021). Introduces the 250-employee / $10M revenue thresholds, grace period until January 31, 2022, and new subscription tiers.
- Sacra Accessed: 2026-05-12
Best available source for a private company. Reports ARR of $20M in 2021 (pre-license change), $165M in 2023, $207M in 2024. Secondary source — Docker Inc. is private and does not publish full financials.
- Docker Blog Scott Johnston 2022 Accessed: 2026-05-12
Official post from Docker Inc. after the first full subscription year. CEO Scott Johnston reports ARR above $50M — over four-fold year-on-year growth. Only primary source with figures directly from the company.
- Kubernetes Blog 2020 Accessed: 2026-05-12
Original document announcing dockershim deprecation in Kubernetes 1.20. Explains technical reasons (CRI incompatibility), timeline, and implications for users.
- Kubernetes Blog Sergey Kanzhelev, Jim Angel, Davanum Srinivas, Shannon Kularathna, Chris Short, Dawn Chen 2022 Accessed: 2026-05-12
Post accompanying dockershim removal in Kubernetes 1.24. Confirms the decision was technical and long-planned, and that the majority of clusters still used Docker at the time of the licensing announcement.
- InfoWorld Scott Carey 2021 Accessed: 2026-05-12
Day-of-announcement coverage. Business context, market reaction, quotes from CEO Scott Johnston.
- InfoQ Matt Campbell 2022 Accessed: 2026-05-12
Connects both threads — dockershim deprecation timeline and the post-license-change market situation. Also explains Mirantis's role as maintainer of cri-dockerd.
- USU Blog Dr. Christian Seeling 2022 Accessed: 2026-05-12
Software Asset Management perspective — practical compliance implications for IT and procurement departments. Useful for the 'compliance panic' context described in the case study.
- Open Container Initiative Accessed: 2026-05-12
OCI homepage describing the image-spec and runtime-spec standards. Includes the history of the initiative (founded 2015, emerging from Docker and CoreOS).
- GitHub / Open Container Initiative Accessed: 2026-05-12
Technical source for the claim about image portability across different container runtimes.
8 sources
- HashiCorp Blog Armon Dadgar 2023 Accessed: 2026-05-12
Primary canonical source — co-founder Dadgar's official post explaining the motivation (hyperscalers monetizing without contributing), scope of the change, and what remains under MPL. August 10, 2023.
- IBM Newsroom 2024 Accessed: 2026-05-12
Official IBM press release. $6.4 billion enterprise value, $35 per share. April 24, 2024.
- Spacelift Blog Flavius Dinu 2024 Accessed: 2026-05-12
Most comprehensive analysis of practical consequences — full timeline of the saga (August 10, 2023 through OpenTofu GA in January 2024), breakdown of who is affected and how, ecosystem perspective.
At the time link was verified, the article was updated (Feb 2026). Archive link points to the original version of the text.
- Gruntwork Blog Josh Padnick 2023 Accessed: 2026-05-12
Perspective from one of the largest Terraform ecosystem partners, directly affected by the change. Good example of enterprise reaction and 'compliance panic' described in the case study.
- Linux Foundation 2023 Accessed: 2026-05-12
Original announcement of the project's adoption by the Linux Foundation. Includes quotes from Jim Zemlin, list of 140+ supporting companies, and commitment of 18+ FTE developers for a minimum of 5 years. September 20, 2023.
- TechCrunch Ron Miller 2023 Accessed: 2026-05-12
Journalistic coverage with a quote from Jim Brikman (Gruntwork) explaining the name and project plans.
- HashiCorp Developer Accessed: 2026-05-12
Canonical description of the provider model: 'Terraform relies on plugins called providers to interact with cloud providers, SaaS providers, and other APIs.' Providers are distributed separately from Terraform Core with their own release cycles.
- HashiCorp Developer Accessed: 2026-05-12
Technical description of the plugin architecture — providers as separate binaries communicating with Terraform Core via RPC. Confirms providers can be written by any party. Companion to the Providers doc above.
7 sources
- Unity Blog Matt Bromberg 2024 Accessed: 2026-05-12
This URL originally contained the runtime fee announcement (September 12, 2023). Unity subsequently replaced the content with the cancellation notice — the original announcement is no longer available at this address.
- The Register Thomas Claburn 2023 Accessed: 2026-05-12
Best account of the reversal — documents how quickly Unity was forced to retreat, revised thresholds, removal of retroactivity, and general chaos. Includes stock price context ($39 → $32 in one week).
- SFGate Stephen Council 2023 Accessed: 2026-05-12
Documents Riccitiello's departure from Unity and detailed breakdown of his exit package (~$8.4M in stock options). Most credible source for a specific figure used in the book.
- X (Twitter) Mega Crit 2023 Accessed: 2026-05-12
Immediate official response from Slay the Spire developer (September 13, 2023): 'We have never made a public statement before. That is how badly you fucked up.' Includes announcement of migration to a new engine.
- Terraria Dev 'Unequivocally Condemns' Unity Fee Changes, Donates Over $200,000 to Other Game EnginesGamesRadar Dustin Bailey 2023 Accessed: 2026-05-12
Re-Logic (Terraria developer) declared it would never use Unity again and donated $200,000 to alternative engines including Godot. One of the most financially significant gestures of the protest.
- GameSpot Eddie Makuch 2015 Accessed: 2026-05-12
Retrospective quoting EA interim CEO Larry Probst after two consecutive 'Worst Company in America' titles (2012 and 2013). Confirms both titles and management's reaction.
- Wikipedia Accessed: 2026-05-12
Secondary reference for EA 'Worst Company in America' titles (2012, 2013). The original Consumerist voting results are archived but the site is defunct — Wikipedia's EA section footnotes point to the original Consumerist articles.
255 Topics
5 sources
- Understanding Evolution (UC Berkeley) Accessed: 2026-05-12
- The Collector Greg Pasciuto 2026 Accessed: 2026-05-12
- PubMed Central / NLM William G. Powderly 2019 Accessed: 2026-05-12
- American Phytopathological Society Amy Y. Rossman, Mary E. Palm 2002 Accessed: 2026-05-12
- Ephytia / INRAE Accessed: 2026-05-12
French agricultural research agency. Concise and direct: 'Phytophthora infestans which is not a fungus but a water mould also known as oomycete.'
22 sources
- GrokiPedia Accessed: 2026-05-12
- eylenburg.github.io Alphonse Eylenburg 2025 Accessed: 2026-05-12
Comprehensive visual timeline of browser engine history and consolidation.
- Electron.js (Official Docs) Accessed: 2026-05-12
- Electron.js (Official Docs) Accessed: 2026-05-12
- Wikipedia Accessed: 2026-05-12
- Slack Engineering Blog Anaïs Betts 2016 Accessed: 2026-05-12
Slack's own account of adopting Electron for its desktop application.
- Wikipedia Accessed: 2026-05-12
- How-To Geek Chris Hoffman 2019 Accessed: 2026-05-12
Accessible overview of CEF (Chromium Embedded Framework) prevalence in desktop applications.
- 3perf.com Ivan Akulov 2020 Accessed: 2026-05-12
Confirms Notion's use of Electron. Detailed performance analysis.
- Medium Stepan Zharychev 2020 Accessed: 2026-05-12
Analysis of Warcraft 3 Reforged's use of a Chromium-based renderer and resulting performance issues.
- Coherent Labs Accessed: 2026-05-12
Example of a AAA game using a Chromium-based UI rendering solution.
- MDN Web Docs Accessed: 2026-05-12
Official, technical, authoritative overview of browser architecture and rendering pipeline.
- BrowserStack 2023 Accessed: 2026-05-12
More accessible description of browser rendering engine architecture. Companion to the MDN article above.
- Stack Overflow 2011 Accessed: 2026-05-12
Community discussion with usable definitions distinguishing engines, frameworks, and libraries.
- GameFromScratch 2015 Accessed: 2026-05-12
Accessible explainer with concrete examples distinguishing library, framework, and engine concepts.
- Free Software Foundation Greg Farough 2023 Accessed: 2026-05-12
FSF's response to Google's Web Environment Integrity API proposal — one of several examples of Google's unilateral influence over web standards.
- The Register Thomas Claburn 2023 Accessed: 2026-05-12
Google ultimately abandoned the Web Environment Integrity proposal following widespread backlash.
- Phoronix Michael Larabel 2022 Accessed: 2026-05-12
Documents Google's unilateral decision to drop JPEG-XL from Chrome despite community support for the format.
- The Register Thomas Claburn 2026 Accessed: 2026-05-12
Google reversed its JPEG-XL decision. Illustrates the cycle of unilateral moves and reversals.
- Plausible Blog Marko Saric 2021 Accessed: 2026-05-12
Overview of the AMP saga — Google's proprietary web framework that received artificial search ranking boosts, and its eventual demise.
- Fast Company JR Raphael 2021 Accessed: 2026-05-12
History of Chrome OS from its first release to the tenth anniversary. Good quotes about the original vision.
Factual reference — dates, 2009 announcement, origin story.
15 sources
- VGInsights / Sensor Tower 2025 Accessed: 2026-05-12
Primary data source for game engine market share statistics.
- Creative Bloq Joe Foley 2025 Accessed: 2026-05-12
More accessible discussion of the VGInsights report findings.
- ILM 2020 Accessed: 2026-05-12
Official ILM press release on StageCraft technology. Cites all partners and describes the virtual production pipeline powered by Unreal Engine.
- IndieWire Bill Desowitz 2020 Accessed: 2026-05-12
Narrative account of the StageCraft technology. Good background companion to the ILM press release.
- Epic Games / Unreal Engine Accessed: 2026-05-12
Official Epic Games overview of Unreal Engine use cases across industries: automotive, film, architecture, broadcast.
- The Marketing Society Stephen Barnes 2021 Accessed: 2026-05-12
Covers BMW and other cross-industry Unreal Engine adoption examples.
- The Native Creation Rick Canfield 2024 Accessed: 2026-05-12
Film, aerospace, architecture, automotive — concrete examples of UE5 adoption beyond gaming.
- PC Gamer Wes Fenlon 2023 Accessed: 2026-05-12
Interview with Gabe Amatangelo. Best qualitative account of CD Projekt RED's decision to abandon REDengine for UE5.
- Super Jump Magazine Alex Antra 2022 Accessed: 2026-05-12
Broader industry context — other studios making the same switch, implications for the market.
- Reddit 2026 Accessed: 2026-05-12
Community discussion confirming that practitioners recognize and debate the 'Unreal look' phenomenon.
- YouTube / Corridor Crew Corridor Crew 2025 Accessed: 2026-05-12
Corridor Crew regularly discusses and critiques the 'Unreal Engine look' — identifying it as an aesthetic where virtual productions appear too sterile, plasticky, or 'cheap' despite high technical quality.
- YouTube / Criteon Criteon 2026 Accessed: 2026-05-12
Counterargument to the 'Unreal look' thesis. Useful for presenting a balanced view.
- Video Games Chronicle Chris Scullion 2026 Accessed: 2026-05-12
Director Gore Verbinski's public criticism of Unreal Engine aesthetics in film production. January 2026.
- PC Gamer Christopher Livingston 2026 Accessed: 2026-05-12
- YouTube / Joshua M Kerr Joshua M Kerr 2024 Accessed: 2026-05-12
13 sources
- Stack Overflow 2024 Accessed: 2026-05-12
Official survey results. PostgreSQL ranked #1 most-used database for the second consecutive year.
- EnterpriseDB Blog Marc Linster 2024 Accessed: 2026-05-12
Commentary on the survey results with historical context.
- PostgreSQL.org Accessed: 2026-05-12
Official description of the Core Team structure — 7 members, roles, and governance model.
- PostgreSQL Community Association Accessed: 2026-05-12
Official page of the non-profit legal entity behind the project.
- momjian.us Bruce Momjian 2020 Accessed: 2026-05-12
Documents the EDB/2ndQuadrant conflict and the unwritten 50% contributor rule that emerged from it.
- postgresql.fund Álvaro Hernández 2020 Accessed: 2026-05-12
In-depth discussion of PostgreSQL governance, the majority rule, and structural tensions within the project.
- EnterpriseDB Blog Bruce Momjian 2024 Accessed: 2026-05-12
Accessible explanation of the decentralized power model in the PostgreSQL project.
- InfoWorld Oleg Bartunov 2022 Accessed: 2026-05-12
History of JSON support from PostgreSQL 9.2 to 9.4, written from the developer perspective. Part of the JSON/JSONB (2012→2014) evolution story.
- PostgreSQL Wiki 2014 Accessed: 2026-05-12
Official documentation of JSONB introduction in PostgreSQL 9.4.
- EnterpriseDB Blog Petr Jelinek 2025 Accessed: 2026-05-12
History written by one of the primary contributors — from logical decoding in 9.4 to full logical replication in v10 (2014→2017).
- PostgreSQL.org 2017 Accessed: 2026-05-12
Official press release. Contains Core Team quote about 'years of work' on logical replication.
- Wikipedia Accessed: 2026-05-12
Reference for footnote on potato history — dates, initial European suspicion, spread across the continent.
- Springer Nature J. G. Hawkes, J. Francisco-Ortega 1993 Accessed: 2026-05-12
Academic source for the potato footnote.
3 sources
- TechCrunch Frederic Lardinois, Ingrid Lunden 2018 Accessed: 2026-05-12
- GitHub Blog Taylor Blau 2025 Accessed: 2026-05-12
Torvalds on Git's decentralization philosophy — ideal source for the irony of the world's decentralized version control system being centralized on a single Microsoft-owned platform.
- Gitnux Emilia Santos 2026 Accessed: 2026-05-12
Source for the 500M+ repositories figure and other GitHub scale statistics.
At the time of access, article was updated on May 11.
263 Topics
12 sources
- Karelia Software Blog Dan Wood 2015 Accessed: 2026-05-12
Primary source for the entire Sherlock/Watson story — Dan Wood's own account, including his paraphrase of the conversation with Jobs. Note: the site had an invalid SSL certificate at the time of checking; using a Web Archive version is recommended.
Note: At the time of access original article had an invalid SSL certificate. The Web Archive version is recommended for access.
- Wikipedia Accessed: 2026-05-12
Concise factual summary of the Watson/Sherlock case.
- How-To Geek Justin Pot 2017 Accessed: 2026-05-12
Good overview with the key quote and historical context explaining how the term entered tech vocabulary.
- TechCrunch Sarah Perez 2021 Accessed: 2026-05-12
Tile's reaction on the day of the AirTag announcement. Covers the U1/UWB access issue and unfair competition claims.
- AppleInsider Mike Peterson 2021 Accessed: 2026-05-12
Fuller context for Tile's antitrust allegations against Apple.
- Washington Post Reed Albergotti 2021 Accessed: 2026-05-12
- EUR-Lex 2022 Accessed: 2026-05-12
Full official text of the Digital Markets Act.
- Wikipedia Accessed: 2026-05-12
Footnote reference for AICOA — factual summary of the proposed US legislation.
- Project DisCo 2023 Accessed: 2026-05-12
Footnote reference. Analysis of why AICOA failed to pass.
- arXiv Dieter Coppens, Adnan Shahid, Sam Lemey, Ben Van Herbruggen, Chris Marshall, Eli De Poorter 2022 Accessed: 2026-05-12
Academic source confirming that not all UWB chips are open to third-party developers, limiting possible applications — the technical basis for the Tile/AirTag competitive asymmetry.
- Pocket-lint Elyse Betters Picaro 2022 Accessed: 2026-05-12
Accessible explanation of time-of-flight technology, Precision Finding, and the AirTag context.
- iFixit Craig Lloyd 2019 Accessed: 2026-05-12
Technical analysis of the U1 chip including UWB history and the proprietary nature of Apple's implementation.
10 sources
- Slate Will Oremus 2013 Accessed: 2026-05-12
Primary account of the rejected acquisition offer, based on the original Wall Street Journal report.
Concise factual reference for the $3B offer and other key Snap milestones.
- CNBC Salvador Rodriguez 2019 Accessed: 2026-05-12
Firsthand account of the acquisition process: Zuckerberg invited Systrom to his Menlo Park home, and both completed the deal documents over a weekend with a single lawyer present.
- TechCrunch 2012 Accessed: 2026-05-12
Original day-of-announcement coverage of the $1 billion acquisition.
- TechCrunch Josh Constine 2016 Accessed: 2026-05-12
Original launch coverage (August 2, 2016). Includes user numbers at the time: Instagram 500M total, Snapchat 150M DAU.
- TechCrunch Josh Constine 2017 Accessed: 2026-05-12
Key source for the 82% growth slowdown figure, based on Snap's IPO documents.
- Social Media Today 2017 Accessed: 2026-05-12
Source confirming Instagram surpassed Snapchat's daily active story users within eight months of launch.
- TechCrunch Josh Constine 2017 Accessed: 2026-05-12
One-year anniversary coverage with user comparison figures in the context of Snap's IPO.
- TechCrunch Ivan Mehta 2025 Accessed: 2026-05-12
- Snap Investor Relations 2025 Accessed: 2026-05-12
Official Snap press release for the Perplexity partnership.
13 sources
- Statista Felix Richter 2026 Accessed: 2026-05-12
Quarterly chart showing AWS + Azure + GCP + Alibaba accounting for approximately 70% of the cloud infrastructure market.
- The New Stack Joab Jackson, Lawrence E Hecht 2018 Accessed: 2026-05-12
Redis Labs: 'Cloud providers contribute very little (if anything) to those open source projects.' Core source for the Vampire Strategy mechanism.
- Tech Monitor 2018 Accessed: 2026-05-12
Redis Labs CMO: 'AWS are simply poaching open source investment.' The 'strip mining' framing.
- CMSWire Virginia Backaitis 2018 Accessed: 2026-05-12
Broader context covering Kafka, Redis, MongoDB and the managed services dynamic.
- The Register Tim Anderson 2021 Accessed: 2026-05-12
MongoDB CTO: DocumentDB is '34 per cent compatible' and a 'Frankenbase'. Primary source for the Amazon DocumentDB vs MongoDB case.
- MongoDB Accessed: 2026-05-12
MongoDB's official comparison — naturally one-sided but useful for specific technical incompatibility claims.
- Elastic Blog Steve Kearns 2021 Accessed: 2026-05-12
Official Elastic announcement of the license change away from Apache 2.0 (January 2021). Starting point of the Elasticsearch/OpenSearch saga.
- The New Stack Steven J. Vaughan-Nichols 2024 Accessed: 2026-05-12
History of the AWS OpenSearch fork and its eventual transfer to the Linux Foundation.
- Linux Foundation 2024 Accessed: 2026-05-12
Official announcement of the OpenSearch Software Foundation under the Linux Foundation (September 2024).
- Wikipedia Accessed: 2026-05-12
Concise history of the Elasticsearch fork.
- Wikipedia Accessed: 2026-05-12
History of Kubernetes' creation at Google and donation to CNCF. Wikipedia used as reference here given strong community maintenance of open source project articles, and verified against author's professional experience.
- Wikipedia Accessed: 2026-05-12
Reference for Go's origin at Google — part of the 'home field advantage' context.
- Yugabyte Blog Sid Choudhury 2018 Accessed: 2026-05-12
Optional background. Good context for the Databricks/Azure dynamic and open source monetization models generally.
4 Chapters
##1 Topic
2 sources
- Wikipedia Accessed: 2026-05-12
History of the phrase, first known print usage (1957), George Fuechsel, context from the early years of computing.
- Atlas Obscura Rob Stenson 2016 Accessed: 2026-05-12
Best narrative account of the phrase's history. Documents the earliest known printed occurrence in the Times Daily of Hammond, Indiana (November 10, 1957) and the Fuechsel story. Most thorough source found on the topic — the author searched press archives and appears to have corrected the existing historical record.
276 Topics
6 sources
- StatPearls / PubMed Central Aleksandar Popovic; Martin R. Huecker. 2023 Accessed: 2026-05-14
Distinguishes random error from systematic error (bias), defines selection bias and sampling bias in a research context.
- UNC Chapel Hill / ERIC Lorraine K. Alexander, Brettania Lopes, Kristen Ricchetti-Masterson, Karin B. Yeatt 2015 Accessed: 2026-05-14
Classic epidemiological definition of selection bias.
- Scribbr Kassiani Nikolopoulou 2022 Accessed: 2026-05-14
Definition, types, and examples of selection bias. More accessible than the UNC source.
- Eval Academy Sheldon Kallio 2022 Accessed: 2026-05-14
Practical discussion with concrete examples.
- IBM Tom Krantz , Alexandra Jonker 2026 Accessed: 2026-05-14
- Anodot Accessed: 2026-05-14
References Gartner estimates on the financial impact of poor data quality.
6 sources
- Qualtrics 2022 Accessed: 2026-05-14
Comprehensive overview of common survey design errors: leading questions, assumptive questions, double-barreled questions.
- Kantar Meghan Bazaman 2025 Accessed: 2026-05-14
Clear distinction: double-barreled = ambiguity, leading = bias.
- SurveyMonkey Abigail Matsumoto Accessed: 2026-05-14
Practical examples including loaded questions.
- Boise State University OER Seung Youn (Yonnie) Chyung 2025 Accessed: 2026-05-14
Academic discussion of primacy effect, left-side selection bias, and acquiescence bias in Likert scales.
- Wiley / Performance Improvement Seung Youn (Yonnie) Chyung, Megan Kennedy, Ingrid Campbell 2018 Accessed: 2026-05-14
Peer-reviewed study demonstrating that response option ordering significantly affects survey results.
- Simply Psychology Saul McLeod 2025 Accessed: 2026-05-14
Accessible overview of Likert scales, response order effects, and their impact on results.
11 sources
- Google / DoubleClick 2019 Accessed: 2026-05-14
Original Google report. Primary source for the 53% abandonment / 3-second threshold statistic. Available as PDF.
- Google Ad Manager Blog Alex Shellhammer, Juliette Neel 2016 Accessed: 2026-05-14
Google's own blog post promoting the report findings.
- Kissmetrics 2026 Accessed: 2026-05-14
Shows that bounce rate — the metric underpinning the 53% statistic — cannot distinguish satisfied from frustrated users. Google Analytics 4 introduced 'engaged sessions' precisely because the classic metric was misleading. Good counterpoint to cite alongside the Google report.
- Cambridge University Press / European Review Marilyn Strathern 1997 Accessed: 2026-05-14
The article in which the popular formulation of Goodhart's Law appears: 'When a measure becomes a target, it ceases to be a good measure.' Published in European Review, Vol. 5(3), pp. 305–321.
- Princeton University Press Jerry Z. Muller 2018 Accessed: 2026-05-14
The most widely read contemporary book on metric pathologies. Connects Goodhart's Law to case studies from education, medicine, the military, and business. ISBN: 9780691174952.
- Wikipedia Accessed: 2026-05-14
- Google web.dev Barry Pollard, Jeremy Wagner 2025 Accessed: 2026-05-14
Official Google documentation defining TTFB and its relationship to FCP and LCP. Authoritative and kept up to date.
- Google web.dev Accessed: 2026-05-14
Main documentation for Google's UX metric suite: LCP, INP, CLS — each measuring a different aspect of user experience.
- Shopify Engineering Blog Sia Karamalegos 2022 Accessed: 2026-05-14
Practical breakdown of the metric hierarchy (TTFB → FCP → LCP → INP) from an e-commerce engineering perspective. Illustrates precisely the ambiguity of 'loading time' discussed in this chapter.
- Analytics at Meta / Medium 2022 Accessed: 2026-05-14
Well-written practical guide on the pitfalls of proxy metrics in A/B testing, written by Meta engineers with strong methodological grounding. Ideal for the 'we measure what we can, not what we want' argument.
- Towards Data Science Eryk Lewinson 2020 Accessed: 2026-05-14
Accessible synthesis of Muller, Goodhart's Law, and Campbell's Law with examples. Useful bridge between technical and popular-science audiences.
6 sources
- Landon in a Landslide: The Poll That Changed PollingHistory Matters / George Mason University 1936 Accessed: 2026-05-14
At the time of composing this list the original article was not available (HTTP/404), but the content was archived by the Wayback Machine.
Contains the original Literary Digest text from October 31, 1936. Most accessible source with the verbatim forecast.
- Wikipedia Accessed: 2026-05-14
Election results, Gallup and Digest comparison, with references to primary sources. Acceptable as a supplementary reference.
- Oxford University Press / Public Opinion Quarterly Peverill Squire 1988 Accessed: 2026-05-14
The most important academic source for this case. The only empirical study of the failure's causes, based on 1937 Gallup data. Concludes that non-response bias — not sampling bias — was the dominant cause. Vol. 52, No. 1, Spring 1988, pp. 125–133. Also available via JSTOR (https://www.jstor.org/stable/2749114) and as PDF (https://criticalthinkingtext.wordpress.com/wp-content/uploads/2017/02/squire-literary-digest.pdf).
- Smithsonian Magazine Jackie Mansky 2016 Accessed: 2026-05-14
Good historical context for the Gallup breakthrough. Credible popular-science source.
- ProQuest Blog 2016 Accessed: 2026-05-14
References original press archives from the Boston Globe. Interesting detail: Gallup publicly predicted his own accuracy in advance.
- University of Pennsylvania / randomservices.org Accessed: 2026-05-14
Raw state-by-state Digest data. Useful for readers who want to see the scale of the error directly.
8 sources
- OKTrends (archived by gwern.net) Christian Rudder 2014 Accessed: 2026-05-14
Original blog post, no longer available at the original URL (blog.okcupid.com) — full text archived here. Only source containing the complete original text with all figures: 44%, 2,200 conversations, match counts.
- The New York Times Molly Wood 2014 Accessed: 2026-05-14
First major press account of the story. Frequently cited in academic literature.
- Newsweek Taylor Wofford 2014 Accessed: 2026-05-14
Direct interview with Rudder. Good quotes for context.
- PNAS Adam D. I. Kramer, Jamie E. Guillory, Jeffrey T. Hancock 2014 Accessed: 2026-05-14
The original Facebook study. Its publication in PNAS was itself part of the scandal.
- PNAS Inder M. Verma 2014 Accessed: 2026-05-14
Official PNAS editorial concern regarding the lack of informed consent from participants.
- Slate David Auerbach 2014 Accessed: 2026-05-14
Useful analysis covering both scandals simultaneously.
- Luvze Dylan Selterman 2014 Accessed: 2026-05-14
Brief, accessible ethical analysis by a psychologist. Compares both experiments and explains the IRB question.
- James Grimmelmann (personal site) James Grimmelmann, Leslie Meltzer Henry 2014 Accessed: 2026-05-14
Formal legal letter requesting access to IRB protocols under Maryland state law. For readers interested in the legal dimension.
6 sources
- Nature Jeremy Ginsberg, Matthew H. Mohebbi, Rajan S. Patel, Lynnette Brammer, Mark S. Smolinski, Larry Brilliant 2009 Accessed: 2026-05-14
Original GFT paper. Published in Nature (online November 2008, print February 2009). Primary source for the entire case study. Nature 457, 1012–1014.
- Nature Declan Butler 2013 Accessed: 2026-05-14
February 2013 Nature article that first publicized the scale of GFT's failure. Starting point of the public story. Nature 494, 155–156.
- Science David Lazer, Ryan Kennedy, Gary King, Alessandro Vespignani 2014 Accessed: 2026-05-14
The most important academic source for this case. Science article analyzing the causes of failure: algorithm dynamics, big data hubris, absence of CDC data as a corrective signal. Coined the terms 'parable of Google Flu' and 'big data hubris'. Science 343, 1203–1205.
- Harvard / SSRN David Lazer, Ryan Kennedy, Gary King, Alessandro Vespignani 2014 Accessed: 2026-05-14
Follow-up analysis showing the problems persisted after Google's modifications.
- Wired David Lazer, Ryan Kennedy 2015 Accessed: 2026-05-14
Accessible synthesis by the authors of the key Science paper. Good companion to the academic source.
- Time Bryan Walsh 2014 Accessed: 2026-05-14
Journalistic account with direct Lazer quotes. Includes the 'Dewey beats Truman' comparison.
289 Topics
5 sources
- BIPM / JCGM Accessed: 2026-05-14
Official reference vocabulary for metrology. Defines measurement error, systematic error, random error, and uncertainty.
The authoritative document for the science of measurement — unmatched as a reference.
JCGM (Joint Committee for Guides in Metrology) is the international body by BIPM (Bureau International des Poids et Mesures / International Bureau of Weights and Measures) responsible for this standard.
- Wikipedia Accessed: 2026-05-14
Key definition: random error = noise, systematic error = bias. Good overview from a metrology and statistics perspective.
- Scribbr Pritha Bhandari 2021 Accessed: 2026-05-14
Clear explanation with examples. Includes the key framing: random error is 'noise' that blurs the true signal of what's being measured.
- University of Maryland Physics R. H. B. Exell Accessed: 2026-05-14
Classic academic treatment from a physical sciences and laboratory experiment perspective.
- PubMed Central / Critical Care Pierre Squara, Thomas W L Scheeren, Hollmann D Aya, Jan Bakker, Maurizio Cecconi, Sharon Einav, Manu L N G Malbrain, Xavier Monnet, Daniel A Reuter, Iwan C C van der Horst, Bernd Saugel 2020 Accessed: 2026-05-14
Defines bias, systematic error, and random error with medical examples and clear diagrams distinguishing noise from bias.
The only source in this group written strictly for scientists rather than engineers.
The Digital Vacuum Cleaner: Scraping, Crawlers, and the Art of Collecting Everything Except What You Wanted
8 sources
The Digital Vacuum Cleaner: Scraping, Crawlers, and the Art of Collecting Everything Except What You Wanted
- ResearchGate Soumen Chakrabarti, Martin van den Berg, Byron Dom 2000 Accessed: 2026-05-14
Classic Stanford paper on crawler architecture. Foundational for the field and cited in hundreds of subsequent works.
- IEEE Xplore Gaurav Sharma 2024 Accessed: 2026-05-14
Contemporary academic survey covering definitions, methods, stages, and technologies.
- IETF M. Koster, G. Illyes, H. Zeller, L. Sassman 2022 Accessed: 2026-05-14
Official internet standard defining how crawlers communicate with site owners via robots.txt.
Also provides background for the concept of a crawler operating 'politely' — robots.txt is the mechanism of that politeness.
- Wikipedia Accessed: 2026-05-14
History, definitions, methods, and legal and technical considerations.
- Firecrawl Blog Bex Tuychiev 2026 Accessed: 2026-05-14
Clear engineering-perspective distinction between a crawler and a scraper.
- ZenRows Yuvraj Chandra 2025 Accessed: 2026-05-14
Compares scraping with API as an official data access channel. Explains the fragility of scrapers in the face of HTML changes.
- Wikipedia Accessed: 2026-05-14
Wikipedia article on the landmark case on the legality of scraping public data. Six years of litigation, ended in settlement. Background for the 'other side of a courtroom' framing.
- Search Engine Land George Nguyen 2021 Accessed: 2026-05-14
Documents the scale of the click farm problem and platform responses.
4 sources
- Nature David W. J. Thompson, John J. Kennedy, John M. Wallace, Phil D. Jones 2008 Accessed: 2026-05-14
Primary source for the entire case. Nature 453, 646–649.
Full text available as PDF via the author's site: https://www.atmos.colostate.edu/~davet/ao/ThompsonPapers/Thompson_etal_Nature2008.pdf
- Nature News Quirin Schiermeier 2008 Accessed: 2026-05-14
Short accessible summary of Thompson's findings for a general audience. Includes quotes from Phil Jones.
- RealClimate Gavin Schmidt 2008 Accessed: 2026-05-14
Discussion by climate scientists explaining both the Thompson paper and the broader context of bucket/engine-intake corrections going back decades. Written for an educated non-specialist reader.
- EarthArXiv Duo Chan, Peter Huybers 2020 Accessed: 2026-05-14
More recent analysis confirming and refining Thompson's results. Reduces the WW2 warm anomaly by 0.26°C using more advanced statistical methods.
12 sources
- Science Thomas R. Karl, Anthony Arguez, Boyin Huang, Jay H. Lawrimore, James R. McMahon, Matthew J. Menne, Thomas C. Peterson, Russell S. Vose, and Huai-Min Zhang 2015 Accessed: 2026-05-14
Primary source for the entire case. The paper that effectively removed the 'hiatus' from the scientific debate. Science 348(6242), 1469–1472.
- Science Zeke Hausfather, Kevin Cowtan, Peter Jacobs, Gavin Schmidt 2017 Accessed: 2026-05-14
Independent verification of Karl's results using buoy, satellite, and Argo data. Includes a clear explanation of the ship/buoy bias mechanism.
- UC Berkeley News Robert Sanders 2017 Accessed: 2026-05-14
Accessible summary of the Hausfather et al. paper. Includes brief interview with Zeke Hausfather on YouTube
- Carbon Brief Roz Pidcock 2015 Accessed: 2026-05-14
Solid day-of-publication journalism. Quotes multiple independent climate scientists.
- Skeptical Science Zeke Hausfather, Kevin Cowtan, David C. Clarke, Peter Jacobs, Mark Richardson, Robert Rohde 2017 Accessed: 2026-05-14
Technical explanation of the ship intake vs. buoy data difference with charts. Explains exactly where the 0.12°C offset comes from.
- Snopes Alex Kasprak 2017 Accessed: 2026-05-14
Documents the Lamar Smith subpoena and NOAA's refusal to comply. Includes quotes from Zeke Hausfather as verification author.
- NOAA 2024 Accessed: 2026-05-14
Official NOAA source on the Argo program: history, scope, and technical specs — 4,000 floats, 2,000m depth, 10-day cycle.
- Wikipedia Accessed: 2026-05-14
Footnote reference. 14-cylinder version rated at 80.08 MW (107,390 hp). First commercial deployment on Emma Mærsk (2006).
- Wärtsilä 2006 Accessed: 2026-05-14
Footnote reference. Official manufacturer press release confirming 80,080 kW output.
- ScienceDirect Sipeng Zhu, Kun Zhang, Kangyao Deng 2019 Accessed: 2026-05-14
Footnote reference. Peer-reviewed source confirming ~50% thermal efficiency, close to the Carnot limit — meaning roughly half the energy is lost as waste heat.
- ScienceDirect Nicola Terry, Ray Galvin 2023 Accessed: 2026-05-14
Footnote reference for household energy consumption context.
- British Gas Simon Wood 2026 Accessed: 2026-05-14
Footnote reference for household energy consumption benchmarks. Specific calculations are left to the reader.
2 sources
- RSC / Analytical Methods Madeline E. Clough, Eduardo Ochoa Rivera, Abbygail M. Ayala, Rebecca L. Parham, Joseph Pennacchio, Henry E. Thurber, Andrew P. Ault, Ambuj Tewari, Anne J. McNeil 2026 Accessed: 2026-05-14
Primary source for the entire case.
- Michigan News Morgan Sherburne 2026 Accessed: 2026-05-14
Accessible summary of the Clough et al. paper. Includes quotes from lead author Madeline Clough.
7 sources
- FERC / NERC 2021 Accessed: 2026-05-14
Federal final report. Confirms that 75.6% of outages resulted from frozen components and fuel supply problems. Describes the gas–power–gas feedback loop.
Note: At the time of writing I encountered 'Sorry, you have been blocked' message on the FERC site. I keep the original link but I was forced to use the Wayback Machine version.
- Update to April 6, 2021 Preliminary Report on Causes of Generator Outages and DeratesERCOT 2021 Accessed: 2026-05-14
Official ERCOT breakdown of generator outage causes by category.
Note: Ercot's security service is now blocking direct access to the report. It is still available via the Wayback Machine.
- University of Texas at Austin Energy Institute 2021 Accessed: 2026-05-14
Most detailed independent academic analysis. Includes full timeline, frequency drop data, and analysis of nameplate capacity vs. actual winter performance.
- BuzzFeed News Peter Aldhous, Zahra Hirji 2022 Accessed: 2026-05-14
CDC excess mortality analysis estimating 700+ deaths as the upper bound of official undercounting.
- Insurance Journal 2022 Accessed: 2026-05-14
Official Texas DSHS figure of 246 deaths with breakdown by cause, as cited by Insurance Journal (January 2022).
- U.S. Energy Information Administration Anodyne Lindstrom, Alex Gorski 2021 Accessed: 2026-05-14
Official EIA data: average price $22/MWh in 2020, $9,000/MWh cap held for 77 hours during the crisis.
- KUT / NPR Austin Matt Largey 2021 Accessed: 2026-05-14
Based on ERCOT board presentation. Quotes CEO Bill Magness. Explains the 59.4 Hz frequency collapse mechanism.
6 sources
- FCC / Public Intelligence 2012 Accessed: 2026-05-14
Primary source for everything: Engineer Doe, the Fifth Amendment invocation, seven engineers, the $25,000 fine for obstructing the investigation. Full partially-declassified version.
- Google Releases Full Report on Street View Investigation, Finds That Staff Knew About Wi-Fi SniffingTechCrunch Peter Ha 2012 Accessed: 2026-05-14
Day-of-publication coverage. Sets Google's PR blog version against the FCC findings.
- The Register Andrew Orlowski 2012 Accessed: 2026-05-14
Concise and pointed. The subheading 'Wheels fall off one rogue engineer claim' says it all.
- Slate Farhad Manjoo 2012 Accessed: 2026-05-14
Best narrative analysis of the FCC report. Identifies Engineer Doe as Marius Milner. Describes the senior manager email: 'Are you saying that these are URLs that you sniffed out of Wi-Fi packets?'
- NPR Eyder Peralta 2013 Accessed: 2026-05-14
Settlement with 38 state attorneys general. Quotes AG Schneiderman's statement.
- EPIC Accessed: 2026-05-14
Chronological compilation of all investigations: UK, Switzerland, Austria, Hungary, Canada, Hong Kong, and others.
8 sources
- simonweckert.com Simon Weckert 2020 Accessed: 2026-05-14
Primary source: project description with video and screenshots of specific streets (Schillingbrücke, Tucholskystraße with 'Google Berlin' label, Michaelbrücke).
- YouTube / Simon Weckert Simon Weckert 2020 Accessed: 2026-05-14
Recording of the performance showing streets turning red in real time.
- Android Authority Tristan Rayner 2020 Accessed: 2026-05-14
Interview with Weckert confirms authenticity, describes a year of planning, 99 Google accounts and SIM cards, the requirement for the cart to keep moving. Includes Google's reaction.
- Android Authority Adamya Sharma 2020 Accessed: 2026-05-14
Contains a key technical detail: the jam disappeared when the cart stopped or when a car passed at normal speed. Quotes a Google Maps engineer confirming the hack was possible.
- Artnet News Caroline Goldstein 2020 Accessed: 2026-05-14
Good narrative account. Includes Weckert's quote about 'rudimentary systems' and the May Day demonstration origin of the idea.
- Washington Post Britany Shammas 2020 Accessed: 2026-05-14
Short account including Google's response ('car or cart or camel').
- ResearchGate Michael Grieves 2016 Accessed: 2026-05-14
Footnote reference. Grieves' own account of the concept's origin from 2002, including the original three-component definition: real space, virtual space, and linking mechanism.
- IBM Nick Gallagher, Maggie Mae Armstrong 2020 Accessed: 2026-05-14
Footnote reference. Covers NASA/Apollo origins through Grieves (2002) to modern IoT applications. Explains the bidirectional data flow illustrated by the thermostat example.
7 sources
- michaeleisen.org Michael Eisen 2011 Accessed: 2026-05-14
The primary source for the entire case. Eisen discovered and documented the feedback loop — exact multipliers (1.270589 and 0.9983), timeline from April 8 to 18, and analysis of both bots' strategies.
- Amazon Seller Lists Book at $23,698,655.93 — Plus ShippingCNN John D. Sutter 2011 Accessed: 2026-05-14
Day-of coverage. Quotes Eisen and a pricing algorithm expert. Includes: 'It's like you put on the gas and didn't have the handbrake.'
Note: the original CNN article and new snapshots on Wayback Machine are not available at the time of compiling this. Instead I provide a link to snapshot from May 1, 2011.
- The Register Rik Myslewski 2011 Accessed: 2026-05-14
Concise technical account. Includes price details after the loop unwound.
- Fast Company Dionysios Demetis 2019 Accessed: 2026-05-14
Analysis that connects the case to the broader question of algorithmic reality construction. References a Journal of the Association for Information Systems publication.
- ACM Digital Library Le Chen, Alan Mislove, Christo Wilson 2016 Accessed: 2026-05-14
Foundational empirical paper documenting the scale of algorithmic pricing on Amazon (~500 bot-using sellers among the top 1,600 products). Identifies 'price jitter' patterns — chaotic price spikes caused directly by bots responding to each other. Academic foundation for the Fruit Fly case. WWW '16, pp. 1339–1349.
- Wharton / lmusolff.com Leon Musolff 2025 Accessed: 2026-05-14
Empirically documents how pricing algorithms enter mutual feedback loops producing outcomes far from market equilibrium. Provides the formal model for the mechanism described narratively in the text.
- Stanford Law School Renato Nazzini, James Henderson 2024 Accessed: 2026-05-14
Legal review documenting real cases of algorithmic price-fixing (including a poster-fixing conviction on Amazon). Frames bot-to-bot feedback loops as a systemic antitrust risk. Good context for 'consensus is not correctness'.
295 Topics
5 sources
- The Data Warehouse ETL ToolkitWiley Ralph Kimball, Joe Caserta 2004
ISBN: 978-0764567575.
Classic data warehousing reference. Kimball is one of the founding figures of the discipline. Defines ETL as an engineering practice.
- IBM Accessed: 2026-05-14
Accessible but technically grounded overview from a data warehousing pioneer. Covers ETL history from the 1970s through the cloud era.
- AWS Accessed: 2026-05-14
Practical overview with emphasis on relational databases and data pipelines. Explains the original rationale for ETL.
- SAS Accessed: 2026-05-14
Well-written non-academic overview. Traces ETL from its 1970s origins through data warehousing to the present day, without unnecessary jargon.
- Wikipedia Accessed: 2026-05-14
Solid definition with history, transformation typology, and data warehousing context. Well-referenced against technical literature.
6 sources
- Political Economy Research Institute, UMass Amherst Thomas Herndon, Michael Ash, Robert Pollin 2013 Accessed: 2026-05-14
Original critique paper. Contains the row number references (footnote 9, p. 7), the UK/New Zealand example with figures 2.4% / 19 years / -7.6%, and the corrected result of +2.2%. Working Paper No. 322.
- NBER Carmen Reinhart, Kenneth Rogoff 2010 Accessed: 2026-05-14
Original paper advancing the 90% debt threshold thesis and the -0.1% growth figure. Published in American Economic Review, Papers and Proceedings, 100(2), 573–578.
- Bloomberg Peter Coy 2013 Accessed: 2026-05-14
Concise FAQ from the moment of disclosure. Confirms the -0.1% → +2.2% correction and includes R&R's response.
- The Conversation Jonathan Borwein, David H. Bailey 2013 Accessed: 2026-05-14
Solid technical account. Lists the omitted countries (Australia, Austria, Belgium, Canada, Denmark) and quotes Paul Ryan.
- Wikipedia Accessed: 2026-05-14
Good compilation of political citations: Ryan budget, Olli Rehn, George Osborne. Includes the R&R response from the New York Times.
- Committee for a Responsible Federal Budget 2013 Accessed: 2026-05-14
Documents R&R's own correction (+0.2% rather than +2.2%) and explains the methodological difference between the two figures.
6 sources
- Genome Biology Mark Ziemann, Yotam Eren, Assam El-Osta 2016 Accessed: 2026-05-14
Original study establishing the 20% figure. Scanned 3,597 papers from 18 leading genomics journals (2005–2015). Confirms SEPT2 → date and MARCH1 → date conversions. Genome Biology 17, 177.
- PLOS Computational Biology / PubMed Central Mandhri Abeysooriya, Megan Soria, Mary Sravya Kasu, Mark Ziemann 2021 Accessed: 2026-05-14
Follow-up scan of 11,117 papers with gene lists (2014–2020). Result: 30.9% contained errors — the problem grew after 2016 rather than shrinking.
- The Conversation Mark Ziemann, Mandhri Abeysooriya 2021 Accessed: 2026-05-14
Written by the original study authors. Illustrates what happens when ETL is replaced by Excel — a natural bridge from the previous case study.
- The Verge James Vincent 2020 Accessed: 2026-05-14
Quotes HGNC coordinator Elspeth Bruford. Confirms 27 genes renamed, including MARCHF1, SEPTIN1, SEPTIN2. Explains why Microsoft won't change the default settings.
- Vice Gavin Butler 2020 Accessed: 2026-05-14
Good coverage with HGNC quotes and historical context for gene naming changes.
- Retraction Watch Mandhri Abeysooriya, Mark Ziemann 2023 Accessed: 2026-05-14
Current overview of the problem by the original study authors. Notes ongoing errors in Nature Communications, PLOS ONE, and Scientific Reports.
5 sources
- The Risks Digest Clive Feather 1996 Accessed: 2026-05-14
Original 1996 report of the incident. Cited by Wikipedia and Encyclopedia MDPI as a primary source. Risks Digest 18(7), 25 April 1996.
- Scholarly Community Encyclopedia (MDPI) 2022 Accessed: 2026-05-14
Solid referenced overview. Includes a chronology of incidents from 1996 to 2021. Cites the Risks Digest as primary source.
- Tedium Ernie Smith 2016 Accessed: 2026-05-14
Thorough long-form account of the problem's history. Includes the AOL spokesperson quote to the Scunthorpe Evening Telegraph, plus examples from Google SafeSearch and iOS.
- Heritage Gateway Accessed: 2026-05-14
Official archaeological heritage record. Documents Anglo-Saxon settlement traces on the site of present-day Scunthorpe.
- Wikipedia Accessed: 2026-05-14
Reference for the first recorded appearance of the town's name.
4 sources
- Wired Christopher Null 2015 Accessed: 2026-05-14
Well-documented first-person account. Includes specific details about Bank of America, mortgage forms, and the email address null@nullmedia.com.
- Gizmodo A;ossa Walker 2016 Accessed: 2026-05-14
Based on the BBC article. Confirms issues with airline tickets and the IRS. Includes Jennifer's quote: 'I've been asked why I'm calling and when I try to explain the situation, I've been told there's no way that's true.'
- Born SQL 2017 Accessed: 2026-05-14
Solid technical treatment of the problem from a SQL perspective: validation, escaping, form handling. References Christopher Null's article and adds engineering analysis. Good bridge between narrative and mechanics.
- Streamline Verify Frank Strafford 2018 Accessed: 2026-05-14
Documents the case of Angela Johnson Null in the GSA-SAM system. Shows the problem affects multiple people beyond Jennifer and Christopher.
4 Chapters
##2 Topics
8 sources
- MacTutor History of Mathematics, University of St Andrews J. J. O'Connor, E. F. Robertson 1999 Accessed: 2026-05-14
Authoritative academic source. Covers uncertainty around dates and birthplace, the House of Wisdom connection, and the etymology of 'algebra' and 'algorithm'.
- MacTutor History of Mathematics, University of St Andrews J. J. O'Connor, E. F. Robertson 2001 Accessed: 2026-05-14
Academic account of the journey of Hindu-Arabic numerals from India through the Arab world to Europe. Covers al-Khwarizmi's role in popularising the positional system and zero as a placeholder. Cites original Arabic sources and Latin translations.
- Episodes in the Mathematics of Medieval IslamSpringer J. L. Berggren 2003
ISBN: 978-0-387-40605-3
Standard academic reference on Islamic mathematics. Covers al-Khwarizmi's contributions in algebra and the decimal system. Cited by MacTutor and other academic sources as a reference text.
- Britannica Accessed: 2026-05-14
Standard encyclopaedic overview. Cites the Latin translation 'Algoritmi de numero Indorum' as the source of the word 'algorithm'.
- NASA Science 2017 Accessed: 2026-05-14
Accessible but reliable overview. Traces the path from the Arabic name through the Latin 'Algoritmi' to the English 'algorithm'.
- University of Melbourne / Find an Expert Debbie Passey 2024 Accessed: 2026-05-14
Good popular-science piece with solid academic grounding. Also covers the role of Hindu-Arabic numerals and zero in the context of modern computing.
- Introduction to Algorithms (4th ed.)MIT Press Cormen, Leiserson, Rivest, Stein 2022
ISBN: 978-0-262-03968-5
Universally known as 'CLRS' — the standard algorithms textbook used in university courses worldwide. Chapters 6–8 cover sorting algorithms (heapsort, quicksort, mergesort); chapters 8–9 cover linear-time sorting. Includes formal complexity analysis for each algorithm.
- MIT OpenCourseWare 2020 Accessed: 2026-05-14
Official MIT course syllabus. Lists sorting as a central topic and recommends CLRS as the reference text. Free access to lecture materials.
4 sources
- NIST Computer Security Resource Center Accessed: 2026-05-14
Official NIST definition: 'An algorithm that, given the same inputs, always produces the same outputs.'
- ScienceDirect Topics Accessed: 2026-05-14
Academic overview with references to scientific literature. Covers determinism in serial and parallel computing contexts, including race conditions.
- GeeksforGeeks Accessed: 2026-05-14
Standard industry overview with examples (Merge Sort as deterministic, Randomized QuickSort as non-deterministic) and illustrative code.
- Wikipedia Accessed: 2026-05-14
Accessible overview with a good list of non-determinism sources (user input, hardware timer, race conditions) — matching what the text describes as primary sources of non-determinism in IT.
303 Topics
9 sources
- CNBC Jeff Cox 2012 Accessed: 2026-05-14
Cites the official BATS statement on the cause: 'single match engine [...] encountered a software bug related to IPO auctions, which rendered open customer orders in this symbol range inaccessible.' Confirms the A–BF ticker range and three erroneous AAPL transactions.
- Los Angeles Times 2012 Accessed: 2026-05-14
Day-of account. Quotes CEO Joe Ratterman. Confirms the $16 offer price, IPO withdrawal, and SEC contact with BATS.
- Fordham Journal of Corporate and Financial Law Thomas Michael 2012 Accessed: 2026-05-14
Legal analysis of the event. Confirms the sequence of events and consequences for underwriters (~$7.1M in lost fees).
- The Wall Street Journal Tom Lauricella, Scott Patterson, David Benoit 2012 Accessed: 2026-05-14
Confirms impact on other companies
Note: article is behind a paywall, but even the headline and summary confirm the key details of the event.
- Institutional Investor Loch Adamson 2012 Accessed: 2026-05-14
Good description of the failure mechanism: one of 32 matching engines, the 10:45 timing, a bug in the messaging system preventing price updates.
- Cboe Holdings / PR Newswire 2016 Accessed: 2026-05-14
Official acquisition announcement. $3.2B at announcement (September 2016), transaction closed February 2017 at $3.4B.
- Wikipedia Accessed: 2026-05-14
Footnote reference. Confirms Paul Maritz's 1988 Microsoft email 'Eating our own Dogfood' to Brian Valentine as the origin of the term.
- Linux Foundation 2015 Accessed: 2026-05-14
Footnote reference. Torvalds in his own words: Git became self-hosting one day after its announcement (April 7, 2005).
Footnote reference. Confirms the timeline: development started April 3, 2005; self-hosting achieved April 7 — four days later.
7 sources
- British Educational Research Journal Adrian Kelly 2021 Accessed: 2026-05-14
Solid academic treatment. Analyses the algorithm mechanism, the small-cohort problem, and compares the situations in England and Ireland. Cites official Ofqual documents. Vol. 47(3).
- University of Bristol / Centre for Multilevel Modelling George Leckie, Lucy Prior Accessed: 2026-05-14
Academic data analysis. Confirms that 40% of Centre-Assessed Grades were downgraded by one or more grades.
- Wikipedia Accessed: 2026-05-14
Complete chronology: results announced August 13, 'no U-turn' August 15, decision reversed August 17, Ofqual chief executive resigned August 25.
- BERA (British Educational Research Association) Anthony Kelly 2021 Accessed: 2026-05-14
Solid overview of the mechanism and political context. Quotes Ofqual's deputy chief regulator on the disproportionate impact on students from disadvantaged backgrounds.
- Container Solutions Blog Charles Humble 2021 Accessed: 2026-05-14
Good technical description of the mechanism: the 15-student threshold, three-year school history, ignoring teacher assessments. Quotes Curtis Parfitt-Ford.
- LSE Impact Blog Daan Kolkman 2020 Accessed: 2026-05-14
Analysis from an algorithmic accountability perspective. Confirms the protest slogan and data on grade increases at private vs. state schools (+4.7 percentage points).
- CNBC Sam Shead 2020 Accessed: 2026-05-14
Day-of-crisis coverage. Confirms ~39% of grades were downgraded and the government U-turn.
9 sources
- CNN Business Nick Thompson 2012 Accessed: 2026-05-14
Cites the official Mildura police statement. Confirms the 70 km displacement, 46°C heat, and stranded motorists without food or water for up to 24 hours.
- Cult of Mac Luke Dormehl 2025 Accessed: 2026-05-14
Reproduces the full police statement text. Confirms all key figures.
- Slate Fruzsina Eördögh 2012 Accessed: 2026-05-14
Adds the data provenance context: the coordinates came from the Australian government gazetteer, which tagged Murray-Sunset National Park as 'Mildura Rural City'.
- International Business Times Dave Smith 2013 Accessed: 2026-05-14
Documents the Fairbanks, Alaska incident.
- The Guardian Charles Arthur 2012 Accessed: 2026-05-14
Confirms the September 28, 2012 apology and the recommendation to use Google Maps via the browser.
Note: Apple's official apology page has since been removed from apple.com — which, given the circumstances, feels entirely on brand. The page is also blocked from Web Archive, so no cached copy is available. Screenshots circulate online but cannot be verified for authenticity, so the linked article will have to serve as the primary reference.
- Wikipedia Accessed: 2026-05-14
Confirms acquisitions (Placebase 2009, Poly9 2010, C3 Technologies 2011), Cook's apology of September 28, 2012, the recommendation to use Google Maps via the browser, and the departures of Forstall and Williamson.
- Wikipedia Accessed: 2026-05-14
Confirms the dismissal mechanism: Forstall refused to sign the apology as the 'directly responsible individual' for Maps.
- Steve JobsSimon & Schuster Walter Isaacson 2011
ISBN: 1-4516-4853-7
Source for the '$40 billion' quote and 'thermonuclear war' against Android. Foundation of the narrative about Apple's motivation to break its dependency on Google.
- Knowledge at Wharton Eric Clemons, Peter Fader 2012 Accessed: 2026-05-14
Covers the strategic background of Apple's decision. Quotes experts on the cost of the political choice to remove Google Maps.
315 Topics
8 sources
- Business Dynamics: Systems Thinking and Modeling for a Complex WorldMcGraw-Hill John D. Sterman 2000
ISBN: 978-0072389159.
Standard academic textbook on system dynamics. Formally defines feedback loops, covers reinforcing and balancing loops with mathematical examples.
- Thinking in Systems: A PrimerChelsea Green Publishing Donella H. Meadows 2008
ISBN: 978-1603580557.
Widely read popular-science introduction to systems thinking. Defines feedback loops in the first chapter through everyday analogies (thermostat, population, economy). Written for non-specialists.
- MIT OpenCourseWare Erik Demaine, Jason Ku, Justin Solomon 2020 Accessed: 2026-05-14
Formal treatment of recursion as a foundation of algorithmics, contrasted with iterative (linear) approaches.
Accessible explanation of recursion as 'a function that calls itself.' Everyday analogies, no mathematical jargon.
- Sapiens: A Brief History of HumankindHarper Yuval Noah Harari 2015
ISBN: 978-0062316110.
Source for the Level One / Level Two chaos distinction. Examples: weather (Level One), stock markets and revolutions (Level Two).
Harari holds a D.Phil. in history from Oxford (2002), specialising in medieval military history. Sapiens is a popular-science work outside his academic specialisation — which the text acknowledges.
- The Hebrew University of Jerusalem Accessed: 2026-05-14
Reference for Harari's academic credentials from his university profile page.
Solid philosophical and mathematical treatment of chaos theory. Defines sensitive dependence on initial conditions, nonlinearity, and aperiodicity as the three characteristics of mathematical chaos. Explains why chaos ≠ randomness.
- American Physical Society 2003 Accessed: 2026-05-14
History of Lorenz's discovery. Explains the butterfly effect as sensitive dependence on initial conditions, with historical and mathematical context.
12 sources
- ProPublica Heather Vogell, Haru Coryne, Ryan Little 2022 Accessed: 2026-05-14
Main investigative piece (October 15, 2022). Describes the YieldStar mechanism, Seattle data, Camden Property Trust, and quotes from former RealPage employees.
- ProPublica Heather Vogell 2024 Accessed: 2026-05-14
Day-of-filing account (August 23, 2024). Quotes AG Kantér and the substance of the charges.
- National Association of Attorneys General 2024 Accessed: 2026-05-14
Case page with docket number, list of defendants, and participating states.
- Wilson Sonsini Jeffrey C. Bank, Brian Smith, Jacob Lozano, Angela L. Brown, Noora Bayrami 2025 Accessed: 2026-05-14
Detailed legal analysis of the November 2025 DOJ settlement and consent decree terms.
- White House Council of Economic Advisers 2024 Accessed: 2026-05-14
Original CEA report (December 17, 2024). Source for the $3.8B figure and $70/month per unit. Methodology based on ACS, Zillow, and RealPage data.
- ProPublica Heather Vogell 2025 Accessed: 2026-05-14
Non-monetary DOJ settlement with RealPage (November 26, 2025). Context for parallel settlements with landlords.
- ProPublica Rent Barons 2025 Accessed: 2026-05-14
Greystar's non-monetary settlement with DOJ (August 2025).
- GRC Report 2025 Accessed: 2026-05-14
$50M Greystar settlement in the tenant class action (October 2025). $141M total from 26 landlords.
- California Attorney General 2025 Accessed: 2026-05-14
Official press release (November 19, 2025). $7M state settlement involving nine attorneys general.
- Multifamily Executive Joe Bousquin 2009 Accessed: 2026-05-14
Quotes Camden CEO Campa on the paradox of rising income alongside higher tenant turnover. Source for the $10M net income figure.
- Axios Emily Peck 2024 Accessed: 2026-05-14
Accessible popularisation of the CEA report with original graphics and RealPage's response.
- Propmodo Franco Faraudo` 2026 Accessed: 2026-05-14
Industry analysis of what the DOJ settlement changes and what it does not. PropTech sector perspective.
10 sources
- SEC / CFTC 2010 Accessed: 2026-05-14
Primary regulatory report (104 pages, October 2010). Source for all key figures: Waddell & Reed, $4.1B, hot potato trading, the 5-second pause, and cancelled transactions.
- CFTC Andrei Kirilenko, Albert S. Kyle, Mehrdad Samadi, Tugkan Tuzun 2014 Accessed: 2026-05-14
CFTC analytical report (March 2014). Key data on HFT and the hot potato mechanism.
- SEC / CFTC 2010 Accessed: 2026-05-14
Preliminary report issued days after the crash.
- Washington Post Zachary A. Goldfarb 2010 Accessed: 2026-05-14
Day-of-publication coverage of the SEC/CFTC report release. Quotes from regulators.
- SIFMA Katie Kolchin 2020 Accessed: 2026-05-14
Comprehensive industry retrospective with full chronology and dates of regulatory mechanisms introduced after the crash.
- NPR Planet Money Jacob Goldstein 2010 Accessed: 2026-05-14
Accessible explanation of the mechanism for a general audience. Good narrative context.
- Journal of Finance / University of Cambridge Repository Andrei Kirilenko, Albert S. Kyle, Mehrdad Samadi, Tugkan Tuzun 2017 Accessed: 2026-05-14
Peer-reviewed paper using CFTC data. Analysis of HFT's role in the crash.
- Hastings Business Law Journal Ian Poirier 2012 Accessed: 2026-05-14
Legal analysis. Covers stub quotes and regulatory mechanisms introduced after the crash.
- IT Brew Billy Hurley 2025 Accessed: 2026-05-14
Current perspective connecting lessons from 2010 to AI automation risk. Good essayistic supplement.
- NY Fed Liberty Street Economics Adam Biesenbach, Marco Cipriani 2012 Accessed: 2026-05-14
Federal Reserve Bank of New York analysis. Independent regulatory perspective from outside the SEC/CFTC.
11 sources
- The Markup Aaron Sankin, Dhruv Mehrotra for Gizmodo, Surya Mattu, Annie Gilbertson 2021 Accessed: 2026-05-14
First independent analysis of 5.9 million real PredPol predictions. Key evidence for systematic targeting of Latino and Black neighbourhoods.
- The Markup Dhruv Mehrotra, Surya Mattu, Annie Gilbertson, Aaron Sankin 2021 Accessed: 2026-05-14
Methodology behind the investigation. Describes how Gizmodo found 7.8 million predictions on an unsecured AWS server.
- Motherboard / Vice Caroline Haskins 2019 Accessed: 2026-05-14
Interview with researchers. Key quote: PredPol is in practice a 'moving average' of previous arrest locations.
- Significance (RSS) Kristian Lum, William Isaac 2016 Accessed: 2026-05-14
Key simulation study. Running PredPol on Oakland data reveals a self-reinforcing loop — the algorithm repeatedly sends police to the same locations. Vol. 13.
- arXiv Danielle Ensign, Sorelle A. Friedler, Scott Neville, Carlos Scheidegger, Suresh Venkatasubramanian† 2017 Accessed: 2026-05-14
Formal mathematical analysis of the feedback loop in PredPol. Demonstrates the divergence mechanism between predictions and actual crime.
- NYU Law Review Online Rashida Richardson, Jason M. Schultz, Kate Crawford 2019 Accessed: 2026-05-14
Analysis of corrupted input data (arrest records) as a source of systemic errors. 94 N.Y.U. L. Rev. Online 15.
- Brennan Center for Justice Tim Lau 2020 Accessed: 2026-05-14
Overview of predictive policing systems and controversies. Covers LAPD and Chicago implementations.
- FBI Law Enforcement Bulletin Zach Friend 2021 Accessed: 2026-05-14
Law enforcement perspective on the mechanism and deployment. Primary source for the '500-square-foot locations' description in Santa Cruz.
- MIT Technology Review Tate Ryan-Mosley, Jennifer Strong 2020 Accessed: 2026-05-14
Interview with Hamid Khan (Stop LAPD Spying Coalition). Activist perspective vs. the COVID budget-cut narrative.
- The Next Web Thomas Macaulay 2020 Accessed: 2026-05-14
Coverage of the programme termination announcement (April 21, 2020). Quotes Police Chief Moore on the budget rationale.
- MIT Technology Review Will Douglas Heaven 2020 Accessed: 2026-05-14
Broad context for the debate on algorithmic racism in policing following the events of 2020.
10 sources
- The Boston Globe Liz Kowalczyk 2010 Accessed: 2026-05-14
Original article describing the January 2010 death of an 89-year-old at MGH. Ten nurses did not hear alarms for 20 minutes. Cited in all subsequent regulatory reports.
- The Boston Globe Liz Kowalczyk 2011 Accessed: 2026-05-14
Investigation identifying 216 deaths in the US between 2005 and 2010 linked to patient monitor alarm problems. Broad documentation of the scale of the issue.
- The Boston Globe Liz Kowalczyk 2011 Accessed: 2026-05-14
Detailed descriptions of 11 deaths, including UMass 2010 (alarms ignored for nearly an hour) and UMass 2007 (75 minutes without response).
- The Joint Commission 2013 Accessed: 2026-05-14
Official regulatory alert. 80 deaths and 13 cases of permanent harm in the TJC database between 2009 and 2012. 85–99% of alarms required no intervention.
- ECRI Institute 2026 Accessed: 2026-05-14
ECRI listed alarm hazards as the number one health technology risk for four consecutive years (2011–2015). Annual report series; link leads to current edition.
- AHRQ / PSNet Michele M. Pelter, Barbara J. Drew 2015 Accessed: 2026-05-14
Synthetic review of cases and mechanisms. Accessible introduction for non-specialist readers.
- National Library of Medicine Kelly Creighton Graham, Maria Cvach 2010 Accessed: 2026-05-14
First published study (2010) to define and document the scale of the phenomenon. Cited in all subsequent reports. doi:10.4037/ajcc2010651. Vol. 19(1), pp. 28–35.
- Biomedical Instrumentation & Technology Maria Cvach 2012 Accessed: 2026-05-14
Integrative review. Data on 350–700 alarms per bed per day and 566 deaths in the FDA database between 2005 and 2010. Vol. 46(4), pp. 268–77.
- PLOS One Barbara J. Drew, Patricia Harris, Jessica K. Zègre-Hemsey, Tina Mammone, Daniel Schindler, Rebeca Salas-Boni, Yong Bai, Adelita Tinoco, Quan Ding, Xiao Hu 2014 Accessed: 2026-05-14
UCSF study on 461 ICU patients. 2.5 million unique alarms over 31 days; 187 audible alarms per bed per day.
- The Boston Globe Liz Kowalczyk 2013 Accessed: 2026-05-14
Example of a successful intervention: BMC reduced alarms from 88,000 to 10,000 per week on a single ward. A useful counterpoint to the narrative of inevitability.
326 Topics
10 sources
- MathWorks (MATLAB Documentation) Accessed: 2026-05-14
Concise technical definition: a local minimum is a point better than its neighbours; a global minimum is better than all possible points. Introduces the concept of 'basins of attraction'.
- Machine Learning Mastery Jason Brownlee 2021 Accessed: 2026-05-14
Accessible article from an ML perspective. Good examples of when local search is sufficient and when global search is needed.
- Baeldung on Computer Science Panagiotis Antoniadis 2024 Accessed: 2026-05-14
Formal definitions with visual examples and discussion of algorithms (gradient descent, simulated annealing, etc.).
- FICO Xpress Optimization Accessed: 2026-05-14
Technical documentation. Good analogy: 'finding a valley in a range of mountains vs finding the deepest valley'.
- GeeksforGeeks Accessed: 2026-05-14
Accessible introduction with formal mathematical definitions. Suitable for technical readers.
- NOAA / National Ocean Service 2024 Accessed: 2026-05-14
Official US government source. Explains the three different definitions of 'highest mountain' (above sea level, from Earth's centre, from base to peak).
- USGS Accessed: 2026-05-14
Official source: total height of Mauna Kea is nearly 33,500 feet (10,211 m), with its base ~6,000 m below sea level.
- Wikipedia Accessed: 2026-05-14
Precise data: Chimborazo is 2,168 m farther from Earth's centre than Everest.
- Wikipedia Accessed: 2026-05-14
MOLA data: 21.287 km height, approximately 2.5× Everest. Approximately tied with Rheasilvia on Vesta as the tallest mountain in the Solar System.
- Wikipedia Accessed: 2026-05-14
2,228 m above sea level. Highest peak of continental Australia.
10 sources
- Etsy Seller Handbook 2026 Accessed: 2026-05-14
Etsy officially acknowledges the problem: 'In the past, Etsy's search placed heavy emphasis on the title, pushing some sellers to pack in keywords. This often leads to longer, harder-to-read titles.' Key admission from the platform itself.
- Etsy Community Announcements 2022 Accessed: 2026-05-14
Official Etsy announcement that product descriptions would begin influencing search rankings — the starting point for the wave of keyword stuffing in descriptions.
- Etsy Seller Handbook 2017 Accessed: 2026-05-14
Silverman announces Context Specific Ranking (CSR) in September 2018 and signals 'less focus on your titles'. Confirms search and discovery as a strategic priority.
- SEC EDGAR 2017 Accessed: 2026-05-14
Silverman, as incoming CEO, lists 'enhancing search and discovery' as one of four main strategic priorities after taking the role in May 2017.
- VintageMaineia / Etsy Vintage Seller Encyclopedia 2016 Accessed: 2026-05-14
Contains a verbatim quote from an Etsy admin: 'incentives do exist for sellers to stuff our titles for internal search purposes'. Updates from February and June 2018 track the lack of progress. Well-documented seller-side account of the problem.
- CindyLouWho2 Cindy Lou Who 2 2017 Accessed: 2026-05-14
Independent analysis of the Etsy algorithm from one of the first people to reverse-engineer it. Detailed account of 2017 changes including the Blackbird Technologies acquisition and AI search tests.
- CindyLouWho2 Cindy Lou Who 2 2018 Accessed: 2026-05-14
Analysis of a Q&A with an Etsy engineer. Details on Context Specific Ranking and its implementation.
- Modern Retail Julia Waldow 2024 Accessed: 2026-05-14
Includes a direct quote from Etsy's director of search, Andrew Stanton, acknowledging keyword stuffing as a platform problem. Seller accounts of being crowded out by SEO.
- Craft Industry Alliance Abby Glassenberg 2021 Accessed: 2026-05-14
Analysis of Etsy's 'Amazon-ification' under Silverman. Context of investor pressure vs. the 'handmade' mission.
- EcommerceBytes Ina Steiner 2017 Accessed: 2026-05-14
Coverage of Silverman's October 2017 'Keep Commerce Human' mission announcement. A useful counterpoint for the irony of the case study.
11 sources
- Meta / Facebook Newsroom Adam Mosseri 2018 Accessed: 2026-05-14
Official Facebook announcement describing the pivot to 'meaningful social interactions' (January 11, 2018). Primary source for the strategic context of the entire case.
- Washington Post Jeremy B. Merrill, Will Oremus 2021 Accessed: 2026-05-14
Based directly on internal Facebook documents (Facebook Papers). Reveals the 5× weight for all emoji reactions, internal warnings from 2019, and the sequence of algorithm adjustments.
- CNN Business Rachel Metz 2021 Accessed: 2026-05-14
Detailed figures from internal documents: full MSI weight table (Like=1, reactions=5, comments=30). History of algorithm adjustments from 2017 to 2020.
- The Seattle Times / Washington Post Will Oremus, Jeremy B. Merrill 2021 Accessed: 2026-05-14
Contains the exact 2019 document quote: Angry reactions were 'much more frequent' on content categorised as 'civic low quality news, civic misinfo, civic toxicity, health misinfo, and health antivax content'.
- Nieman Journalism Lab Shraddha Chakradhar 2021 Accessed: 2026-05-14
Synthesis of multiple documents. Includes the finding that zeroing out the Angry weight reduced misinformation without reducing engagement.
- The Hill Shirin Ali 2021 Accessed: 2026-05-14
Detailed chronology of algorithm adjustments: 2018 (4×), 2019 (demote mechanism), 2020 (1.5×), September 2020 (0×).
- Wikipedia Accessed: 2026-05-14
Good overview of the leak. Lists the media outlets involved in publishing the Facebook Papers and the chronology of disclosures.
- Washington Post Cat Zakrzewski, Cristiano Lima-Strong 2021 Accessed: 2026-05-14
Original account of Haugen's identity being revealed. Context of her role in the Civic Integrity Team.
- TechDirt Mike Masnick 2021 Accessed: 2026-05-14
Important critical voice: notes that all reactions (not just Angry) received a 5× weight, and that the Washington Post may have created a misleading picture of intentional anger-boosting. Useful for precise description of the mechanism.
- NPR Scott Neuman 2018 Accessed: 2026-05-14
- CNBC Jillian D’Onfro 2018 Accessed: 2026-05-14
9 sources
- NBC News 2014 Accessed: 2026-05-14
Day-of account (December 15, 2014). Confirms A$100 minimum and four times the normal rate. Contains original Uber Sydney tweets.
- TechCrunch Catherine Shu 2014 Accessed: 2026-05-14
Account of Uber's reversal. Notes that Uber had a US-only surge cap policy that did not apply in Australia.
- IBTimes UK Sean Martin 2014 Accessed: 2026-05-14
Official Uber apology. Quote: 'We didn't stop surge pricing immediately. This was the wrong decision.'
- Mic Jordan Valinsky 2014 Accessed: 2026-05-14
Contains Uber's statement on refunds. Context of the absence of an Australian policy at the time of the incident.
- Slate Alison Griswold 2014 Accessed: 2026-05-14
Thoughtful analysis of the dilemma. Argues for Uber-subsidised free rides as a model. Historical context of previous pricing controversies.
- New Republic Danny Vinik 2014 Accessed: 2026-05-14
Covers both sides of the debate (economists vs. public opinion). Good background for the 'incentive model' argument.
- New York Attorney General Letitia James 2014 Accessed: 2026-05-14
Critical context: this agreement was reached five months before Sydney, showing the policy existed in the US but was not applied globally.
- Washington Post Mark Berman 2014 Accessed: 2026-05-14
Coverage of the NY AG agreement and announcement of a US-wide policy.
- The Drive Stephen Edelstein 2018 Accessed: 2026-05-14
Documents the final solution (Global Safety Coordinator team). Lists London 2017 and New York 2016 as subsequent failures after Sydney. Shows it took four years to implement a systematic fix.
11 sources
- The Verge Colin Lecher 2019 Accessed: 2026-05-14
Original ADAPT disclosure (April 25, 2019). Based on NLRB documents. Confirms automatic generation of warnings and terminations without manager involvement. Quotes an Amazon lawyer's letter referencing hundreds of terminations at a single facility.
- MIT Technology Review Charlotte Jee 2019 Accessed: 2026-05-14
Summary of the disclosure. Includes Amazon's full denial. Good source for presenting both sides.
- Vice Lauren Kaori Gurley 2022 Accessed: 2026-05-14
NLRB documents from JFK8. Specific thresholds: warning at 30 min TOT, termination at 120 min. Example of a manager questioning an employee about time spent in the bathroom.
- PressProgress 2021 Accessed: 2026-05-14
Amazon internal wiki. Target set at the 75th percentile — a 'moving goalpost' for workers.
- Foxglove / UK Parliament Briefing 2022 Accessed: 2026-05-14
Compilation of NLRB and Washington State regulator documents. Injury rates 9× higher than the industry average. Formal confirmation of the TOT mechanism.
- Engadget Amrita Khalid 2022 Accessed: 2026-05-14
Original account of the leaked memo (June 2022). Direct quote from the 2021 memo: 'If we continue business as usual, Amazon will deplete the available labor supply in the US network by 2024'.
- The Register Richard Currie 2022 Accessed: 2026-05-14
Confirms 150% annual turnover rate and average tenure of 8 months. Regional details (Phoenix, Inland Empire).
- Fortune Sophie Mellor 2022 Accessed: 2026-05-14
Six levers Amazon was considering. Effect of a $1 wage increase: +7% to the available labour pool.
- CBS News Aimee Picchi 2021 Accessed: 2026-05-14
Official Amazon apology. Worker context: 'tracked down to the second', bathrooms 5–10 minutes from workstations.
- CNBC Annie Palmer 2021 Accessed: 2026-05-14
Source for the letter quotes: 'we need a better vision for how we create value for employees'; 'Earth's Best Employer and Earth's Safest Place to Work'.
- Engadget Richard Lawler 2021 Accessed: 2026-05-14
Announcement of changes to how TOT is measured. Context: published the same day the Washington Post released its injury rate report.
10 sources
- UK Government (GOV.UK) Robert Francis QC 2013 Accessed: 2026-05-14
Primary report. 290 recommendations across four volumes. The foundational source for the entire case study.
- House of Commons Library Tom Powell 2013 Accessed: 2026-05-14
Parliamentary synthesis. Context for both Francis reports (2010 and 2013) and the government's response.
- Hansard / UK Parliament 2013 Accessed: 2026-05-14
Prime Minister and Health Secretary speeches on the day of publication. Source for the 'preoccupation with a narrow set of top-down targets pursued to the exclusion of patient safety' quote.
- Full Fact 2013 Accessed: 2026-05-14
Key source for the nuance around the 400–1,200 figure: explains its origin, why it is contested, and what the Francis report actually stated.
- Wikipedia Accessed: 2026-05-14
Good chronological overview. Contains quotes from the Healthcare Commission and Francis reports regarding the death toll controversy.
- Wikipedia Accessed: 2026-05-14
History of the 4-hour target from 2000/2004. Documents workaround mechanisms (acute assessment units 'outside [A&E] for statistical purposes') and the threshold change from 98% to 95% in 2010.
- Annals of Emergency Medicine Suzanne Mason, Ellen J. Weber, Jennifer Freeman 2011 Accessed: 2026-05-14
Academic study showing a growing proportion of patients leaving A&E in the last 20 minutes before the deadline — empirical evidence of 'teaching to the test'.
- PubMed Central Julie Eatock, Matthew Cooke, Terry P. Young 2017 Accessed: 2026-05-14
Analysis of the long-term effects of the 4-hour target within the wider NHS system.
- Inquests and Inquiries Accessed: 2026-05-14
Solid synthesis of both Francis reports. Quotes on the board's 'obsession' with Foundation Trust status. Lists regulatory consequences including Duty of Candour and Freedom to Speak Up Guardians.
- British Psychological Society Narinder Kapur 2014 Accessed: 2026-05-14
Analysis of the psychological mechanisms behind the culture of fear and silence. Behavioural perspective complementing the regulatory accounts.
6 Chapters
##2 Topics
29 sources
- The Master AlgorithmBasic Books Pedro Domingos 2015
ISBN: 978-0465065707.
Key reference. Domingos divides ML into five 'tribes': symbolists, connectionists, evolutionaries, Bayesians, and analogizers. The closest spiritual match to the explainer's structure, albeit with a slightly different taxonomy.
Review available at: https://www.timeshighereducation.com/books/review-the-master-algorithm-pedro-domingos-allen-lane
- ResearchGate Radhey Shyam, Ria Singh 2021 Accessed: 2026-05-15
Short survey classifying ML into supervised, unsupervised, semi-supervised, and reinforcement learning. Solid basic academic source.
- ACM Tech Talk Pedro Domingos 2015 Accessed: 2026-05-15
Domingos presenting directly to ACM. Companion to the book.
- GeeksforGeeks 2016 Accessed: 2026-05-15
Accessible introduction to ACO with a description of the pheromone mechanism.
- Taylor & Francis Online Duygu Yilmaz Erogl, Umut Akcan 2024 Accessed: 2026-05-15
Confirms Dorigo (1992) as the original ACO source.
- Encyclopedia MDPI Nikola Ivković , Robert Kudelić , Marin Golub 2023 Accessed: 2026-05-15
Encyclopaedic treatment of ACO with description of pheromone trail updates.
- PubMed Central / Multimedia Tools and Applications Sourabh Katoch, Sumit Singh Chauhan, Vijay Kumar 2020 Accessed: 2026-05-15
Peer-reviewed survey covering genetic operators, GA variants, and applications. Good academic overview with a broad literature base.
- IEEE Computer / ACM Digital Library M. Srinivas, Lalit M. Patnaik 1994 Accessed: 2026-05-15
Classic IEEE survey. Describes the analogy to natural processes and the mechanics of genetic algorithms. A historical authority in the field.
- PubMed Central / BioData Mining Timmy Manning, Roy D Sleator, Paul Walsh 2012 Accessed: 2026-05-15
Good introduction from a bioinformatics perspective. Explains fitness function, selection, and crossover using the travelling salesman problem as an example.
- Wikipedia Accessed: 2026-05-15
Unusually solid Wikipedia article here: covers the history of the field (from Farmer and Perelson, 1986), algorithm taxonomy (negative selection, clonal selection, immune networks), and links to primary literature. Use as orientation, not as a sole source.
- SSARSC Pankaj Chaudhary, Kundan Kumar 2018 Accessed: 2026-05-15
Concise survey of AIS models and algorithms from the past 20 years. Describes clonal selection and negative selection mechanisms.
- Springer Encyclopedia of Machine Learning Jon Timmis Accessed: 2026-05-15
Encyclopaedia entry from Springer's ML Encyclopaedia. Authoritative definition and taxonomy of AIS. Key bibliographic reference: de Castro & Timmis (2002).
- arXiv Julie Greensmith, Amanda Whitbrook, Uwe Aickelin Accessed: 2026-05-15
Solid survey covering the field's evolution from 1978 to 2008. Includes diagrams and descriptions of negative selection, clonal selection, and dendritic cell algorithms.
- scikit-learn Accessed: 2026-05-15
Official technical documentation. Covers ID3, C4.5, and CART (Breiman et al., 1984), tree advantages and limitations, and overfitting. Authoritative practical source.
- Classification and Regression TreesWadsworth Breiman, Friedman, Olshen, Stone 1984
ISBN: 978-0412048418.
Canonical original work defining the CART algorithm. Cited by scikit-learn as the primary reference.
- DigitalOcean Shaoni Mukherjee 2025 Accessed: 2026-05-15
Accessible introduction with historical context (Breiman as the creator) and a description of the mechanism.
- Wikipedia Accessed: 2026-05-15
Another unusually strong Wikipedia article: covers the algorithm's history (McCarthy, Knuth & Moore 1975), the minimax mechanism, and concrete figures (node reduction of up to 99.8% for chess with optimal ordering).
- Chess Programming Wiki Accessed: 2026-05-15
Specialist technical reference from the chess engine programming community. Cites the original Knuth & Moore (1975) paper and the history of the algorithm's discovery.
- Cornell University CS312 Accessed: 2026-05-15
Cornell course materials. Reliable explanation of the mechanism with pseudocode and a worked tree example.
- Towards Data Science Can Bayar 2021 Accessed: 2026-05-15
Combines both topics (ML decision trees + alpha-beta) in a single piece. Useful for illustrating the difference between a decision tree in ML and a search tree in game AI.
- IBM Think Josh Schneider, Ian Smalley Accessed: 2026-05-15
Solid overview without marketing noise. Covers the architecture of both processors and their ML use cases.
- AWS Accessed: 2026-05-15
Concise comparison from an infrastructure provider.
- TechTarget Chris Tozzi 2025 Accessed: 2026-05-15
Broader enterprise-context discussion. Includes a description of the GPU's parallel architecture.
- DigitalOcean Shaoni Mukherjee 2024 Accessed: 2026-05-15
Practical tutorial with benchmarks. Shows concretely why GPU wins on ML tasks.
- MIT Press Ian Goodfellow, Yoshua Bengio, Aaron Courville 2016 Accessed: 2026-05-15
Canonical reference. Chapters 5 (ML Basics), 6 (Deep Feedforward Networks), and 9 (Convolutional Networks) cover exactly what the explainer describes. Available free online in HTML.
- NVIDIA Blog Jensen Huang 2016 Accessed: 2026-05-15
Describes the historical AlexNet breakthrough and the role of GPUs in deep learning.
- CRV Science Bryan White 2026 Accessed: 2026-05-15
Detailed history from AlexNet (2012) through subsequent GPU architectures. Confirms the 'same hardware, different application' narrative.
- NVIDIA Accessed: 2026-05-15
Official product page. 16,384 CUDA cores, Ada Lovelace architecture.
- NotebookCheck Anil Ganti 2022 Accessed: 2026-05-15
Trade press coverage at launch. Useful as a historical record.
10 sources
- MIT Press / deeplearningbook.org Ian Goodfellow, Yoshua Bengio, Aaron Courville 2016 Accessed: 2026-05-15
Canonical definition of machine learning, generalisation, and the distinction between training and test error.
Note: this is a chapter of book referenced in the previous explainer.
- IBM Think Dave Bergmann Accessed: 2026-05-15
Concise industry-level treatment of generalisation as the fundamental goal of ML.
- Machine Learning Mastery Jason Brownlee 2019 Accessed: 2026-05-15
Accessible but solid piece on why generalisation is ML's 'superpower' and when it fails.
- Wikipedia Accessed: 2026-05-15
Formal definition of i.i.d. with a section on ML implications. Wikipedia is authoritative here — this is a mathematically precise concept.
- Dive into Deep Learning (d2l.ai) Aston Zhang, Zachary Lipton, Mu Li, Alexander J. Smola 2023 Accessed: 2026-05-15
ISBN: 978-1009389433
Academic online textbook. Detailed description of covariate shift, label shift, and concept shift with concrete examples of deployment failures. Direct counterpart to the 'changed coin' analogy in the explainer.
- Chip Huyen's Blog Chip Huyen 2022 Accessed: 2026-05-15
Practical engineering perspective on distribution shift, from the author of Designing Machine Learning Systems. Describes when and why models stop working in production.
- Causality: Models, Reasoning, and InferenceCambridge University Press Judea Pearl 2009
ISBN: 978-0521895606.
Foundational work. Formal framework distinguishing association, intervention, and counterfactual. The basis of the entire causal inference field.
- The Book of WhyBasic Books Judea Pearl, Dana Mackenzie 2018
ISBN: 978-0465097609.
Popular-science companion to Pearl's Causality. Describes the Ladder of Causation and explains why standard ML operates solely at the level of association.
- Towards Data Science Kaushik Rajan 2026 Accessed: 2026-05-15
Solid industry piece connecting Pearl's Ladder to concrete ML failure examples caused by confusing correlation with causation (including the hormone replacement therapy case).
- lgmoneda.github.io Luis Moneda 2021 Accessed: 2026-05-15
More technical but accessible piece on what spurious correlation means specifically in ML and how it differs from the classical statistics concept.
337 Topics
7 sources
- arXiv / ACM Computing Surveys Ninareh Mehrabi, Fred Morstatter, Nripsuta Saxena, Kristina Lerman, Aram Galstyan 2021 Accessed: 2026-05-15
Canonical survey with over 1,000 citations. Taxonomy of bias sources (historical, representation, measurement, etc.) and definitions of fairness. Peer-reviewed, available via arXiv and ACM DL.
- MDPI Big Data and Cognitive Computing Tiago P. Pagano, Rafael B. Loureiro, Fernanda V. N. Lisboa, Rodrigo M. Peixoto, Guilherme A. S. Guimarães, Gustavo O. R. Cruz, Maira M. Araujo, Lucas L. Santos, Marco A. S. Cruz, Ewerton L. S. Oliveira, Ingrid Winkler, Erick G. S. Nascimento 2023 Accessed: 2026-05-15
PRISMA-compliant systematic review covering 2017–2022. Focuses on bias detection and mitigation techniques. More recent than Mehrabi et al.
- IBM Think Alexandra Jonker , Julie Rogers Accessed: 2026-05-15
Authoritative industry explanation of how historical bias is inherited by ML models, with concrete examples including predictive policing and Oakland data.
- Weapons of Math DestructionCrown Cathy O'Neil 2016
ISBN: 978-0553418811.
Key popular-science work. Introduces WMD (Weapons of Math Destruction) defined by three characteristics: opacity, scalability, and resistance to challenge. Directly relevant to the idea of encoding bias into math to make it appear objective.
- Automating InequalitySt. Martin's Press Virginia Eubanks 2018
ISBN: 978-1250074317.
Companion volume focusing on welfare state systems. Documents how historical bias is amplified by automation and applied at institutional scale.
- Ford Foundation Jenny Toomey, Lori McGlinchey 2016 Accessed: 2026-05-15
Online summary of O'Neil's arguments with her own commentary. Useful for readers without access to the book.
- NIST Reva Schwartz, Apostol Vassilev, Kristen Greene, Lori Perine, Andrew Burt, Patrick Hall 2022 Accessed: 2026-05-15
Official US government document defining AI bias and recommending management approaches. Its existence as a NIST publication reflects the seriousness of the correction problem.
8 sources
- ACM Transactions on Knowledge Discovery from Data Shachar Kaufman, Saharon Rosset, Claudia Perlich, Ori Stitelman 2012 Accessed: 2026-05-15
Canonical academic paper formalising the concept of leakage. Identifies it as one of the ten biggest mistakes in data mining and describes the 'no-time-machine requirement' — the prohibition on using features that would not be available at prediction time.
- Wikipedia Accessed: 2026-05-15
Solid overview with breakdown into feature leakage and row-wise leakage. Useful as a quick taxonomy reference.
- IBM Think Tim Mucci Accessed: 2026-05-15
Authoritative industry explanation of both leakage types (target leakage and train-test contamination) with a description of the normalisation-before-split problem.
- Machine Learning Mastery Jason Brownlee 2020 Accessed: 2026-05-15
Well-written practical tutorial with code. Explains precisely why normalising the full dataset before the train-test split is a mistake and how to fix it.
- Google Research Blog / Science Moritz Hardt 2015 Accessed: 2026-05-15
Seminal paper describing exactly the mechanism of iterative leaderboard overfitting: repeated modification of a model based on holdout set results creates a dependency that invalidates the classical holdout method. Published in Science 349(6248).
- NeurIPS 2019 Rebecca Roelofs, Vaishaal Shankar, Benjamin Recht, Sara Fridovich-Keil, Moritz Hardt, John Miller, Ludwig Schmidt 2019 Accessed: 2026-05-15
First large meta-analysis of test set reuse across 100+ Kaggle competitions. Results are surprisingly positive (little evidence of widespread overfitting), but the methodology confirms the reality of the mechanism.
- mrtz.org (Moritz Hardt's Blog) Moritz Hardt 2015 Accessed: 2026-05-15
Influential post demonstrating how to climb a leaderboard on the Heritage Health Prize without examining the data — purely through algorithmic probing of results. Direct documentation of leaderboard hacking.
- Patterns (Cell Press) Sayash Kapoor, Arvind Narayanan 2023 Accessed: 2026-05-15
Review of 294 published papers across 17 scientific fields affected by data leakage. Demonstrates that the problem is neither abstract nor marginal — it has affected hundreds of peer-reviewed publications.
10 sources
- CNBC / Reuters 2018 Accessed: 2026-05-15
Primary report based on five anonymous internal sources. Describes the 1–5 star rating system, Edinburgh team, ten years of training data, the 'executed'/'captured' problem, and the team's disbandment in early 2017.
- MIT Technology Review Erin Winick 2018 Accessed: 2026-05-15
Concise and reliable summary of the Reuters report.
- ACLU Rachel Goodman 2018 Accessed: 2026-05-15
Confirms details on the verbs 'executed' and 'captured' and the proxy discrimination mechanism. Adds legal context (Title VII).
- Cardozo Law Review Lori Andrews, Hannah Bucher 2022 Accessed: 2026-05-15
Legal article describing hindsight bias in recruitment systems. Directly analyses the Amazon case and proxy features in CVs. References the 74% male historical CV pool figure.
- ACM Transactions on Intelligent Systems and Technology Alessandro Fabris, Nina Baranowska, Matthew J. Dennis, David Graus, Philipp Hacker, Jorge Saldivar, Frederik Zuiderveen Borgesius, Asia J. Biega 2025 Accessed: 2026-05-15
Academic survey formalising proxy features and sensitive attribute proxies in hiring contexts (names, institutions, CV language structure as gender proxies).
- Brookings Kyra Wilson, Aylin Caliskan 2025 Accessed: 2026-05-15
Empirical study confirming that AI systems continue to favour names and linguistic features associated with male and white candidates.
- EIGE (European Institute for Gender Equality) 2020 Accessed: 2026-05-15
European data. Women represent approximately 17% of ICT specialists in the EU. More appropriate than US figures given the algorithm was developed and trained in Scotland.
- Statista / STEM Women Diana Elagina 2025 Accessed: 2026-05-15
UK data: 18% women among IT professionals in 2016, declining to 16% in 2019. Confirms that an 80/20 or 84/16 split is a realistic approximation for technical roles in Scotland in 2014.
- Women in Tech Network 2025 Accessed: 2026-05-15
Accessible summary with historical and current data. Only 22% women among AI and data specialists in the UK.
- ResearchGate Vicki R. McKinney, Darryl D. Wilson, Nita Brooks, Anne O'Leary-Kelly 2008 Accessed: 2026-05-15
10 sources
- ProPublica Angwin, Larson, Mattu, Kirchner 2016 Accessed: 2026-05-15
Original investigative report. Describes the 137 questions, 1–10 scale, and false positive/false negative asymmetry for Black and white defendants in a Broward County, Florida sample.
- ProPublica Julia Angwin, Jeff Larson, Surya Mattu and Lauren Kirchner 2016 Accessed: 2026-05-15
Methodology document. Specifies sample size (6,172–7,214 individuals), recidivism definition (2 years), and data sources. Essential for verifying the figures.
- Science Advances / PubMed Central Julia Dressel, Hany Farid 2018 Accessed: 2026-05-15
Peer-reviewed study showing COMPAS is no more accurate than untrained humans, and that just two variables (age and number of prior convictions) achieve the same accuracy. Strong scientific confirmation of the system's limitations.
- Wisconsin Supreme Court 2016 Accessed: 2026-05-15
Primary court document. Wisconsin Supreme Court (not US Supreme Court). Describes the limitations imposed on COMPAS use and the grounds for rejecting the due process challenge.
- Harvard Law Review 2017 Accessed: 2026-05-15
Legal analysis of the ruling. Covers Loomis's arguments, the court's response, and the precedential significance.
- SCOTUSblog 2017 Accessed: 2026-05-15
Documents the US Supreme Court's denial of certiorari (2017). Important for precision: this is not SCOTUS endorsement of the ruling, only a refusal to intervene.
- Harvard Journal of Law & Technology Ellora Israni 2017 Accessed: 2026-05-15
Academic critique of the ruling. Argues the court misunderstood how the algorithm works and what safeguards would be required.
- UCLA Law Review Andrew Lee Park 2019 Accessed: 2026-05-15
Legal survey of proxy discrimination in recidivism systems. Covers both Loomis and the ProPublica data.
- arXiv / Big Data Alexandra Chouldechova 2017 Accessed: 2026-05-15
Mathematical proof of impossibility: it is not possible to simultaneously satisfy calibration (Northpointe's argument) and equal error rates (ProPublica's argument) when base rates differ between groups. Explains why both sides of the dispute were correct within their own frameworks.
- Northpointe (Equivant) William Dieterich, Christina Mendoza, Tim Brennan 2016 Accessed: 2026-05-15
Vendor response to the ProPublica report. Defends calibration as the correct fairness measure. Useful for illustrating that the debate has two sides with formal arguments.
5 sources
- Science Ziad Obermeyer, Brian Powers, Christine Vogeli, and Sendhil Mullainathan 2019 Accessed: 2026-05-15
Primary research paper. Source for all key figures: 17.7%, 46.5%, and the cost-as-proxy mechanism.
- The Lancet Digital Health 2019 Accessed: 2026-05-15
Peer-reviewed medical commentary. Specifies the $1,800 cost difference and the 200 million patients per year affected figure.
- Scientific American Starre Vartan 2019 Accessed: 2026-05-15
Accessible overview with quotes from Obermeyer. Explains the mechanism of choosing cost as a proxy and the limits of the audit.
- NBC News Quinn Gawronski 2019 Accessed: 2026-05-15
Day-of-publication coverage (November 7, 2019). Includes Optum's response and the $1,800 figure.
- Healthcare Finance News Susan Morse 2019 Accessed: 2026-05-15
Industry coverage. Contains Optum's full statement disputing the researchers' conclusions — useful for the company response section.
9 sources
- CNBC / Re/code Mark Bergen 2015 Accessed: 2026-05-15
Most detailed original account. Contains the 'roughly 90 minutes' detail, describes Yonatan Zunger as 'chief architect of Google+', and the '100% Not OK' quote.
- CBS News Amanda Schupak 2015 Accessed: 2026-05-15
Variant timeline ('about two hours later') with Google's full response. Good second source for the chronology.
- CBC News 2015 Accessed: 2026-05-15
Contains Zunger's tweet in full: 'Holy fuck. G+ CA here. No, this is not how you determine someone's target market. This is 100% Not OK.' Confirms 'appalled and genuinely sorry' wording.
- MIT Technology Review Jacke Snow 2018 Accessed: 2026-05-15
First independent verification after two years. Wired tested 40,000 images and confirmed censorship of 'gorilla', 'chimp', 'chimpanzee', and 'monkey'.
- Gizmodo Sidney Fussell 2018 Accessed: 2026-05-15
Analysis of the censorship mechanism. Specifies the list of blocked terms.
- The New York Times Nico Grant, Kashmir Hill 2023 Accessed: 2026-05-15
Investigative piece eight years on. Tests Google, Apple, Amazon, and Microsoft. Confirms Apple Photos and Microsoft OneDrive have the same problem; Amazon Photos returns broad primate results.
- PetaPixel Pesala Bandara 2023 Accessed: 2026-05-15
Summary of the NYT investigation with technical commentary. Includes Google's quote that 'the benefit does not outweigh the risk of harm'.
- NPR Dustin Jones 2021 Accessed: 2026-05-15
Original account of the 2021 Facebook incident. Confirms the shutdown of the entire topic recommendation system.
- Slate Daniel Politi 2021 Accessed: 2026-05-15
Contains Facebook's official statement: 'We disabled the entire topic recommendation feature as soon as we realized this was happening.'
5 sources
- Nature Machine Intelligence Michael Roberts, Derek Driggs, Matthew Thorpe, Julian Gilbey, Michael Yeung, Stephan Ursprung, Angelica I. Aviles-Rivero, Christian Etmann, Cathal McCague, Lucian Beer, Jonathan R. Weir-McCall, Zhongzhao Teng, Effrossyni Gkrania-Klotsas, James H. F. Rudd, Evis Sala & Carola-Bibiane Schönlieb 2021 Accessed: 2026-05-15
2,212 papers found; 415 after initial screening; 62 after quality screening; zero models suitable for clinical use. Describes patient overlap, Frankenstein datasets, shortcut learning, patient positioning as leakage, and peer review failures.
- University of Cambridge 2021 Accessed: 2026-05-15
Official Cambridge press release with quotes from Roberts and Rudd. Accessible summary of the findings.
- Radiology Business Marty Stempniak 2021 Accessed: 2026-05-15
- Nature Machine Intelligence Robert Geirhos, Jörn-Henrik Jacobsen, Claudio Michaelis, Richard Zemel, Wieland Brendel, Matthias Bethge, Felix A. Wichmann 2020 Accessed: 2026-05-15
Canonical paper defining shortcut learning. Background for the section on patient positioning and technical artefacts as spurious features.
- Patterns (Cell Press) Sayash Kapoor, Arvind Narayanan 2023 Accessed: 2026-05-15
Also relevant here: the review covers cases from medicine and imaging, not only EHR. Complements the Roberts et al. findings with a broader reproducibility crisis perspective.
349 Topics
8 sources
- ResearchGate Christian Agrell, Simen Eldevik, Andreas Hafver, Frank Børre Pedersen 2019 Accessed: 2026-05-15
Academic survey of ML limitations in high-risk environments. Formally describes the constraints of correlation-based models when applied to tail events.
Accessible financial-sector explanation of tail risk. Covers normal distribution vs. fat tails.
- Chip Huyen's Blog Chip Huyen 2022 Accessed: 2026-05-15
Engineering-focused explanation of distribution shift and edge cases in ML. Practical and accessible for technical readers.
- The Black Swan: The Impact of the Highly ImprobableRandom House Nassim Nicholas Taleb 2007
ISBN: 978-1-4000-6351-2.
Original source for the term and the three defining criteria of a Black Swan event.
- Britannica Sanat Pai Raikar Accessed: 2026-05-15
Solid encyclopaedic definition with the history of the term (Juvenal → de Vlamingh → Taleb). Covers the 1697 de Vlamingh sighting in Australia.
- Wikipedia Accessed: 2026-05-15
Well-documented article with citations. Covers the three criteria, history, and critiques of the theory. Useful as a quick reference.
- Wikipedia Accessed: 2026-05-15
Confirms the first European observations: Antonie Caen (1636) and Willem de Vlamingh (1697, Swan River, Western Australia). Cygnus atratus as a species endemic to Australia.
- ConsumerAffairs Alexus Bazen 2026 Accessed: 2026-05-15
Contains all three weight figures: average passenger car (~4,000 lbs / ~2 tonnes, EPA), maximum semi-truck weight (80,000 lbs / 40 tonnes, FMCSA). Data current as of 2024.
6 sources
- PNAS Mikhail Belkin, Daniel Hsu, Siyuan Ma, Soumik Mandal 2019 Accessed: 2026-05-15
Canonical paper on the bias-variance tradeoff and the 'double descent' phenomenon. Formally defines overfitting as fitting to noise at the cost of generalisation.
- Neural Computation Stuart Geman, Elie Bienenstock, René Doursat 1992 Accessed: 2026-05-15
Classic paper introducing the formal bias-variance tradeoff description to ML. Historical source of the concept. Neural Computation 4(1), 1–58.
- IBM Think Fangfang Lee Accessed: 2026-05-15
Solid industry explanation describing overfitting as a model learning 'the noise along with the signal'. Includes a conceptual diagram.
- Google Machine Learning Crash Course Accessed: 2026-05-15
Official Google documentation. Accessible and authoritative, with a concrete operational definition.
- Towards Data Science Dmytro Nikolaiev 2021 Accessed: 2026-05-15
Accessible explanation with visualisations and analogies. Good for illustrating the mechanism to non-specialist readers.
- YouTube / 3Blue1Brown 3Blue1Brown 2017 Accessed: 2026-05-15
Widely-viewed popular explanation of how neural networks work. Overfitting appears as a natural context of the learning process.
7 sources
- Hachette Books Tyler Vigen 2015 Accessed: 2026-05-15
ISBN: 978-0-316-33943-8.
Original source for both examples. Book and website contain hundreds of auto-generated spurious correlations.
- tylervigen.com Tyler Vigen Accessed: 2026-05-15
Tyler Vigen's website contains hundreds of auto-generated spurious correlations.
Auto-generated correlation from public datasets.
I deliberately chose absurd examples to illustrate the mechanism, not as verified empirical claims.
- tylervigen.com Tyler Vigen Accessed: 2026-05-15
Tyler Vigen's website contains hundreds of auto-generated spurious correlations.
Auto-generated correlation from public datasets.
I deliberately chose absurd examples to illustrate the mechanism, not as verified empirical claims.
- The Book of Why: The New Science of Cause and EffectBasic Books Judea Pearl, Dana Mackenzie 2018
ISBN: 978-0-465-09760-9.
Canonical source for the distinction between correlation and causation. Formally defines spurious correlation as correlation arising from a confounder or chance.
- Econometrica C. W. J. Granger 1969 Accessed: 2026-05-15
Classic paper defining Granger causality. Historical context for the spurious correlations problem in time-series analysis.
- Towards Data Science Celia Banks 2024 Accessed: 2026-05-15
Accessible explanation of the mechanism .
- Investigative Economics Llewellyn Jones 2021 Accessed: 2026-05-15
Critical analysis of Vigen's data. Documents that some datasets have disappeared and correlations may be based on outdated figures. Useful as an honest caveat to the mozzarella example.
6 sources
- DeepMind Blog Victoria Krakovna, Jonathan Uesato, Vladimir Mikulik, Matthew Rahtz, Tom Everitt, Ramana Kumar, Zac Kenton, Jan Leike, Shane Legg 2020 Accessed: 2026-05-15
Canonical DeepMind article defining specification gaming. Includes examples: Lego stacking, boat racing, social media engagement.
- Victoria Krakovna's Blog Victoria Krakovna 2018 Accessed: 2026-05-15
Extensive, regularly updated list of documented reward hacking cases with literature references. Excellent source for the mini-case sections.
- arXiv Dario Amodei, Chris Olah, Jacob Steinhardt, Paul Christiano, John Schulman, Dan Mané 2016 Accessed: 2026-05-15
Seminal OpenAI/Google Brain paper introducing reward hacking as a formal AI safety problem.
- arXiv David Manheim, Scott Garrabrant 2018 Accessed: 2026-05-15
Academic taxonomy of Goodhart's Law variants. Connects reward hacking to the broader class of objective specification problems.
- Lil'Log (Lilian Weng) Lilian Weng 2024 Accessed: 2026-05-15
Technical but accessible survey with examples. The author is Head of Safety at OpenAI, lending the piece practical authority.
- Wikipedia Accessed: 2026-05-15
Solid introduction with examples and links to primary literature.
14 sources
- NHTSA 2022 Accessed: 2026-05-15
Primary regulatory document. 354 complaints, 416,000 vehicles (2021–2022 Model 3 and Model Y), description of the braking mechanism.
- Automotive News Audrey LaForest 2022 Accessed: 2026-05-15
Day-of-opening industry coverage of the preliminary evaluation.
- Carscoops Sebastien Bell 2022 Accessed: 2026-05-15
Detailed account with context: 354 complaints over 9 months, description of the braking events.
- The Drive Rob Stumpf 2021 Accessed: 2026-05-15
First account after the decision was announced. Cites official Tesla documentation.
- Washington Post 2023 Accessed: 2026-05-15
Tesla engineers were reportedly alarmed by Musk's decision and contacted a former executive to dissuade him. Musk 'was unconvinced and overruled his engineers.' Directly connects radar removal to increased accidents and near-misses. Based on conversations with nearly twelve former employees. Note: article may be paywalled.
- Wikipedia Accessed: 2026-05-15
Well-documented history of the hardware decisions. Includes the quote that Musk 'overruled' engineers who warned against removing radar.
- Carscoops Brad Anderson 2023 Accessed: 2026-05-15
Good secondary account. Quotes former NHTSA adviser Missy Cummings: radar served as a 'sensor fusion way to check if there is a problem', and removing it was 'a big part of' the phantom braking issues.
- Electrek Fred Lambert 2023 Accessed: 2026-05-15
Adds nuance: Musk was frustrated with the quality of available radars, not arguing radar was unnecessary in principle. Useful for avoiding oversimplification.
- Sustainable Business Magazine Accessed: 2026-05-15
Industry assessment comparing camera and LiDAR approaches. Not a primary source — a trade-level evaluation of the technology trade-offs and cost differences.
- Arcadian.ai Olivia Campbell 2025 Accessed: 2026-05-15
Industry-level comparison. Not a primary source — a trade assessment of the two sensor approaches.
- MotorSafety.org Bojan Popic 2022 Accessed: 2026-05-15
Opening of the PE for 1.73 million Hondas (2018–19 Accord, 2017–19 CR-V). 278 complaints, description of the CMBS mechanism.
- Carscoops Chris Chilton 2024 Accessed: 2026-05-15
Expansion of the investigation to ~3 million vehicles. 47 accidents, 93 injuries. Context for the scale of the problem.
- Consumer Reports Chris Chilton 2019 Accessed: 2026-05-15
35,390 vehicles (2019–2020 Mazda3), Smart Brake Support (SBS), false detection mechanism, no injuries reported.
- Mazda North American Operations 2019 Accessed: 2026-05-15
Official manufacturer statement. Confirms 'incorrect programming of the SBS control software'.
8 sources
- SEC EDGAR Rich Barton, Allen Parker 2021 Accessed: 2026-05-15
Primary regulatory document. Contains the exact Barton quote, $304M Q3 write-down, Q4 forecast, and 25% workforce reduction.
- GeekWire Taylor Soper 2021 Accessed: 2026-05-15
2,000 layoffs (25% of workforce), write-down over $500M, 9,790 homes in Q3 inventory.
- The Real Deal 2022 Accessed: 2026-05-15
Total Zillow Offers loss for 2021: $880M. Context: $6B segment revenue, zero profit.
- CBS News Rachel Layne 2021 Accessed: 2026-05-15
Data on contracted and inventory homes (~18,000 combined across various legal states). Source for extrapolation.
- CNN Anna Bahney 2022 Accessed: 2026-05-15
Case-Shiller National Index: +18.8% in 2021, the largest annual gain in the index's 34-year history. Confirms the 'nearly 20%' figure.
- Stanford Graduate School of Business Amit Seru, Greg Buchak 2021 Accessed: 2026-05-15
Academic analysis from Stanford. Describes the structural problem of iBuying, the speed-vs-accuracy tradeoff, and why the model could not function under low-liquidity conditions.
- GeekWire John Cook 2021 Accessed: 2026-05-15
Includes MoxiWorks CEO quote: 'all the AI and machine learning in the world isn't yet up to the task.' Notes the Black Swan framing used as a corporate excuse. Broader AI-in-business context.
- Towards Data Science Florent Buisson 2021 Accessed: 2026-05-15
Good explanation of the Winner's Curse mechanism in the Zillow context. Describes the feedback loop and structural reasons why iBuying was prone to systematic overpaying.
3 sources
- arXiv / KDD 2016 Marco Tulio Ribeiro, Sameer Singh, Carlos Guestrin 2016 Accessed: 2026-05-15
Original paper introducing the Husky vs. Wolf experiment and the LIME explanation method.
- ACM Digital Library Marco Tulio Ribeiro, Sameer Singh, Carlos Guestrin 2016 Accessed: 2026-05-15
Official conference proceedings version. Contains Table 2 'Husky vs Wolf experiment results'.
- Analytics Magazine (INFORMS) Charles Simon 2020 Accessed: 2026-05-15
Accessible discussion of the case with illustration. Useful industry secondary source for non-academic readers.
5 sources
- Radiology Mark O. Wielpütz, 2022 Accessed: 2026-05-15
Editorial commenting on position bias in AI for chest X-rays. The title directly inspired the case study title.
- PubMed / Critical Care Medicine Johannes Rueckel, Wolfgang G. Kunz, Boj F. Hoppe, Maximilian Patzig, Mike Notohamiprodjo, Felix G. Meinel, Clemens C. Cyran, Michael Ingrisch, Jens Ricke, Bastian O. Sabel 2020 Accessed: 2026-05-15
Original study of AI applied to supine chest X-rays. Foundational context for the position bias problem.
- PLOS Medicine John R. Zech, Marcus A. Badgeley, Manway Liu, Anthony B. Costa, Joseph J. Titano, Eric Karl Oermann 2018 Accessed: 2026-05-15
Canonical study documenting hospital-specific shortcuts. Models learn institutional correlations rather than pathology.
- arXiv Vincent Olesen, Nina Weng, Aasa Feragen, and Eike Petersen 2024 Accessed: 2026-05-15
Documents chest drain as a shortcut in pneumothorax classification and ECG cables in atelectasis detection. Confirms the mechanism described in the text.
- Nature Machine Intelligence Robert Geirhos, Jörn-Henrik Jacobsen, Claudio Michaelis, Richard Zemel, Wieland Brendel, Matthias Bethge & Felix A. Wichmann 2020 Accessed: 2026-05-15
Canonical paper defining shortcut learning. Provides the broader academic context for the mechanism in this case study.
8 sources
- SIGBOVIK 2013 Tom Murphy VII 2013 Accessed: 2026-05-15
Original paper. Contains the description of PlayFun's Tetris behaviour and the pausing mechanism.
- TechCrunch John Biggs 2013 Accessed: 2026-05-15
First media account. Quotes the Tetris pausing behaviour.
- Hackaday Mike Szczys 2013 Accessed: 2026-05-15
Contains the direct quote from the paper on pausing: 'Death is imminent, so playfun pauses the game shortly after this and then doesn't unpause it.'
- OpenAI Blog Jack Clark, Dario Amodei 2016 Accessed: 2026-05-15
Primary OpenAI blog post on the boat racing agent. Describes the mechanism, the targets, the fire, and the 20% score above human players.
- Wikipedia Accessed: 2026-05-15
Secondary reference with historical context and links to primary works.
- Wikipedia Accessed: 2026-05-15
Confirms the drives/pain/pleasure mechanism, Norn biochemistry, Creatures 2 release (1998), and Steve Grand's role.
- Alan Zucconi's Blog Alan Zucconi 2020 Accessed: 2026-05-15
Technical explanation of the biochemistry and reward/punishment mechanism in Creatures. Good accessible source for the mechanism described in the case study.
- Creation: Life and How to Make ItWeidenfeld & Nicolson Steve Grand 2000
ISBN: 978-0-7538-1254-0.
Grand's own account of the Creatures project. Primary source for his intentions and description of emergent Norn behaviour.
357 Topics
11 sources
- arXiv / KDD 2016 Marco Tulio Ribeiro, Sameer Singh, Carlos Guestrin 2016 Accessed: 2026-05-15
Original LIME paper. Also cited in Chapter 34 (Husky vs. Wolf experiment).
- arXiv / NeurIPS 2017 Scott M. Lundberg, Su-In Lee 2017 Accessed: 2026-05-15
Original SHAP paper. Formally connects Shapley values to ML interpretability and shows the relationship with LIME.
- Cambridge University Press Lloyd Shapley 1953 Accessed: 2026-05-15
In: Contributions to the Theory of Games II, pp. 307–317.
Original Shapley (1953) paper. Source for the φᵢ(v) formulation and the four axioms. Cite as a book chapter.
- christophm.github.io Christoph Molnar 2025 Accessed: 2026-05-15
Accessible explanation of SHAP, Shapley values, and their history. Open online book widely cited in academic literature.
ISBN: 978-3911578035
Note: this is a chapter of a book that is under copyright, but the chapter itself is freely available online and can be cited directly. I strongly recommend purchasing it.
- arXiv / Communications of the ACM Mengnan Du, Ninghao Liu, Xia Hu 2019 Accessed: 2026-05-15
Formally defines local vs. global interpretability and describes the accuracy-interpretability tradeoff across model classes (decision trees vs. neural networks).
- arXiv / Information Fusion Alejandro Barredo Arrieta, Natalia Díaz-Rodríguez, Javier Del Ser, Adrien Bennetot, Siham Tabik, Alberto Barbado, Salvador García, Sergio Gil-López, Daniel Molina, Richard Benjamins, Raja Chatila, Francisco Herrera 2019 Accessed: 2026-05-15
Canonical XAI survey cited by hundreds of papers. Defines the interpretability-accuracy tradeoff. Key academic background for the 'dirty secret' described in the text.
- arXiv / AI Magazine Bryce Goodman, Seth Flaxman 2016 Accessed: 2026-05-15
Describes the regulatory dimension of the 'why me?' question — the right to explanation for algorithmic decisions under GDPR. Legal background for the loan officer scenario.
- CFPB 2022 Accessed: 2026-05-15
Key US regulatory document. CFPB explicitly states that creditors 'cannot lawfully use technologies in their decision-making processes if using them means that they are unable to provide these required explanations.'
- FTC Andrew Smith, 2020 Accessed: 2026-05-15
FTC guidance confirming that FCRA (1970) and ECOA (1974) already require explanation of algorithm-based decisions — the right to explanation in the US predates ML by 50 years.
- Cyberspace Administration of China / Lexology Jingyuan Shi, Yuchen Lai, Yanyu Lai 2022 Accessed: 2026-05-15
China's first comprehensive algorithm regulations (effective 1 March 2022). Require disclosure of 'basic principles, purposes and mechanics' of recommendation algorithms. The Chinese-market counterpart to GDPR Article 22.
- Carnegie Endowment for International Peace Matt Sheehan, Sharon Du 2022 Accessed: 2026-05-15
Analysis of China's algorithm registry. Describes the episode where ByteDance had to explain its algorithm to CAC officials 'using metaphors and simplified language' — strong narrative context for the interpretability limits section.
6 sources
- Alim.org Doug Laney 2001 Accessed: 2026-05-15
Original paper introducing the 3V definition (Volume, Velocity, Variety). Meta Group Research Note 949, 6 February 2001. Gartner acquired Meta Group in 2005.
Note: the original Gartner blog article does not exist anymore (https://www.gartner.com/en/articles/strategic-predictions-for-2026). Alternative source provided.
- Big Data: A Revolution That Will Transform How We Live, Work, and ThinkHoughton Mifflin Harcourt Viktor Mayer-Schönberger, Kenneth Cukier 2013
ISBN: 978-0-544-00269-2.
Most influential popular-science book on Big Data. Responsible for bringing the term into the mainstream around 2013–2015.
- Big Data: Principles and Best Practices of Scalable Realtime Data SystemsManning Nathan Marz, James Warren 2015
ISBN: 978-1-617-29034-3.
Canonical technical reference. Describes Lambda architecture (batch + speed layer), distributed systems, and failure modes. Standard industry reading.
- Google Research / SOSP 2003 Sanjay Ghemawat, Howard Gobioff, Shun-Tak Leung 2003 Accessed: 2026-05-15
Original paper describing how to scale storage across thousands of servers, treating disk failures as the norm rather than the exception. Foundation of the Big Data infrastructure era.
- Google Research / OSDI 2004 Jeffrey Dean, Sanjay Ghemawat 2004 Accessed: 2026-05-15
Paper that gave rise to Hadoop and the entire Big Data ecosystem. Describes distributed computation on clusters subject to partial failure. Canonical source for 'hundreds or thousands of servers working in parallel'.
- Wikipedia Accessed: 2026-05-15
Classic list of eight false assumptions about distributed networks (the network is reliable, latency is zero, etc.). Origin: Sun Microsystems, 1994–1997. The original has no single canonical URL; Wikipedia reproduces the full list with history.
8 sources
- STAT News Casey Ross, Ike Swetlitz 2018 Accessed: 2026-05-15
Original investigation based on internal IBM documents (deputy chief health officer slide decks). Describes synthetic training data, the hemorrhaging patient case, and deviations from NCCN guidelines.
- STAT News Casey Ross, Ike Swetlitz 2018 Accessed: 2026-05-15
Archived version with the specific case description: '65-year-old man with newly diagnosed lung cancer and evidence of severe bleeding'.
- Boston Globe Casey Ross, Ike Swetlitz 2018 Accessed: 2026-05-15
Independent account using the same documents. Confirms the 2012 partnership date, sales in Asia before training was complete, and NCCN deviations.
- ASH Clinical News 2018 Accessed: 2026-05-15
Account from the medical community (American Society of Hematology). Quotes Dr. David Gorski: 'Watson is basically the MSKCC way, which might or might not be the right way in every case.'
- Medscape Roxanne Nelson 2018 Accessed: 2026-05-15
Clinical perspective. Describes the bias arising from MSK-centric training and generalisation problems across different patient populations.
- BBC News 2014 Accessed: 2026-05-15
Historical context for Watson and its commercialisation. Background for the marketing hype section.
- CIO.de Joab Jackson 2015 Accessed: 2026-05-15
Confirms Watson Health deployment in Africa and the Middle East.
- Healthcare Digital 2026 Accessed: 2026-05-15
Comprehensive retrospective. Covers deployments in Thailand, Korea, and India, and problems with local clinical guidelines.
4 sources
- The New York Times Magazine Charles Duhigg 2012 Accessed: 2026-05-15
Primary journalistic source for the entire case. Describes Andrew Pole, the 25 products, the pregnancy prediction score, the father-and-daughter anecdote, and the coupon shuffling mechanism.
- The Power of Habit: Why We Do What We Do in Life and BusinessRandom House Charles Duhigg 2012
ISBN: 978-1-4000-6928-6.
Contains an expanded version of the same case. The book brought the story to mainstream attention.
- Machine Learning Times / KDnuggets Eric Siegel 2014 Accessed: 2026-05-15
Key debunking article. Argues the causal link between the algorithm and any specific pregnancy 'has essentially been debunked.' Essential for bibliographic honesty regarding the limits of the anecdote.
- Forbes Kashmir Hill 2012 Accessed: 2026-05-15
The article whose headline went viral and cemented the simplified version of the story in public consciousness. Context for how the anecdote took on a life of its own.
10 sources
- Wikipedia Accessed: 2026-05-15
Covers founders (Girouard, Gu, Counselman), company history, and the variables used (GPA, college attended, area of study).
- CNBC Uptin Saiidi 2015 Accessed: 2026-05-15
Original trade coverage. Co-founder Paul Gu quote: 'These variables are extremely predictive on whether someone is likely to pay a loan.' Also contains early discrimination concerns.
- Harvard Digital Innovation Leonardo Leal 2019 Accessed: 2026-05-15
Model analysis. Confirms behavioural data and GPA trajectory as scoring elements.
- CFPB Patrice Alexander Ficklin, Tom Pahl, Paul Watkins 2020 Accessed: 2026-05-15
Regulatory document describing ECOA requirements for AI models and regulatory flexibility. Context for the Explainability Gap.
Note: the original CFPB blog post is no longer available: "The CFPB blog was archived on May 14, 2026."
- CFPB 2022 Accessed: 2026-05-15
'Creditors cannot lawfully use technologies if they cannot provide required explanations.' Also cited in the Interpretability explainer.
- Grant Thornton Tariq A. Mirza, Henry Lau 2023 Accessed: 2026-05-15
Describes Upstart's No-Action Letter (2016/2017), its expiry, and the shift in CFPB's approach between 2020 and 2022. Good context for the evolution of the Upstart–CFPB relationship.
- Consumer Financial Services Law Monitor Mark Furletti, Lori Sommerfield, Chris Willis, Lane Page, David N. Anthony 2025 Accessed: 2026-05-15
Directly references models with 'more than 1,000' variables as a fair lending risk. Regulatory context for proxy discrimination.
- debt.org Bents Dulcio 2024 Accessed: 2026-05-15
One of the more specific citations for the variable count: 'Upstart AI model looks at 1,500 variables based on data from 4.4 million repaid loans.'
- Financer.com Joe Chappius 2026 Accessed: 2026-05-15
'Upstart uses a unique AI system that analyzes over 1,500 variables to assess borrower risk.'
- X (Twitter) HenryInvests 2024 Accessed: 2026-05-15
'Upstart uses over 1,600 variables when evaluating one's credit.' The only source citing 1,600 specifically. Note: this is an investor analysis thread, not an official Upstart statement.
6 sources
- Financial Times Laura Noonan 2017 Accessed: 2026-05-15
Original report on the European rollout of LOXM (31 July 2017). Not accessible without a paywall, but cited by all secondary sources. Financial Times, 31 July 2017.
- Informa Connect / QuantMinds 2018 Accessed: 2026-05-15
Conference report from QuantMinds 2018. Contains statements from Vaslav Glukhov (Head of EMEA e-Trading Quantitative Research, JPMorgan) on the reinforcement learning mechanism, reward function, and system objectives.
- FIA (Futures Industry Association) Charles P. Wallace 2018 Accessed: 2026-05-15
Quotes Glukhov: 'How LOXM is rewarded for being efficient in the market, and how the efficiency of the agency is defined, is stated in the reward function.' Confirms reinforcement learning (not specifically 'deep RL').
- Finance Magnates Jeff Patterson 2017 Accessed: 2026-05-15
Describes the global rollout plan. Confirms training on 'billions of past trades, both real and simulated'.
- SEC / CFTC Andrei Kirilenko, Albert S. Kyle, Mehrdad Samadi, Tugkan Tuzun 2010 Accessed: 2026-05-15
Official regulatory report. Describes how unrelated trading algorithms activating across different parts of the market can cascade into a systemic event. Also cited in Chapter 31 (Flash Crash case study).
- PubMed Central / Social Studies of Science Bo Hee Min, Christian Borch 2022 Accessed: 2026-05-15
Academic analysis of the Flash Crash as a 'normal accident' arising from complex interactions between algorithms. Solid scientific background for the systemic risk section.
6 sources
- DeepMind Accessed: 2026-05-15
Primary source for Move 37: 'a move that had a 1 in 10,000 chance of being used.' Also describes the match against Lee Sedol (18 world titles, AlphaGo winning 4–1).
- Wikipedia Accessed: 2026-05-15
Detailed match account with professional Go players' commentary on Move 37 and Lee Sedol's reaction.
- YouTube / DeepMind Greg Kohs (dir.) 2017 Accessed: 2026-05-15
Feature documentary about the match. Contains original footage of Move 37 and the commentators' real-time reactions.
- Nature David Silver, Aja Huang, Chris J. Maddison, Arthur Guez, Laurent Sifre, George van den Driessche, Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, Sander Dieleman, Dominik Grewe, John Nham, Nal Kalchbrenner, Ilya Sutskever, Timothy Lillicrap, Madeleine Leach, Koray Kavukcuoglu, Thore Graepel & Demis Hassabis 2016 Accessed: 2026-05-15
Original AlphaGo paper describing the architecture (deep neural networks combined with Monte Carlo tree search).
- Nature David Silver, Julian Schrittwieser, Karen Simonyan, Ioannis Antonoglou, Aja Huang, Arthur Guez, Thomas Hubert, Lucas Baker, Matthew Lai, Adrian Bolton, Yutian Chen, Timothy Lillicrap, Fan Hui, Laurent Sifre, George van den Driessche, Thore Graepel & Demis Hassabis 2017 Accessed: 2026-05-15
AlphaGo Zero paper. Quote: 'Starting tabula rasa, our new program AlphaGo Zero achieved superhuman performance, winning 100–0 against the previously published, champion-defeating AlphaGo.'
- DeepMind Blog David Silver, Demis Hassabis 2017 Accessed: 2026-05-15
Official DeepMind blog post. Accessible explanation of the self-play mechanism and the differences from the previous version.
365 Topics
7 sources
- MIT Press / ResearchGate Joaquin Quionero-Candela, Masashi Sugiyama, Anton Schwaighofer, Neil D. Lawrence 2009 Accessed: 2026-05-15
Canonical academic source. Formalises covariate shift, label shift, and concept shift as distinct mechanisms. Cited by hundreds of papers as the reference taxonomy.
- Chip Huyen's Blog Chip Huyen 2022 Accessed: 2026-05-15
Accessible engineering explanation. Covers population shift, temporal shift, and concept drift with production examples. Author is Senior Staff Engineer at NVIDIA and author of Designing Machine Learning Systems (O'Reilly). Also cited in Chapters 34 and 36.
- ACM Computing Surveys João Gama, Indrė Žliobaitė, Albert Bifet, Mykola Pechenizkiy, Abdelhamid Bouchachia 2014 Accessed: 2026-05-15
Canonical concept drift survey. Classifies drift types and adaptation methods. Most cited academic source for this specific category.
- paulgraham.com Paul Graham 2002 Accessed: 2026-05-15
Original essay that launched the era of Bayesian spam filters. Describes 'Viagra' as a strong spam signal. Historical source for the 2003 context.
- paulgraham.com Paul Graham 2003 Accessed: 2026-05-15
Sequel essay. Describes spammer counter-evolution (keyword obfuscation) as an early example of adversarial concept drift. Background for the V1agra/Vi@gra examples.
- BMJ Laure Wynants, Ben Van Calster, Gary S. Collins, Richard D. Riley, Georg Heinze, Ewoud Schuit, Elena Albu, Banafsheh Arshi, Vanesa Bellou, Marc M. J. Bonten, Darren L. Dahly, Johanna A. Damen, Thomas P. A. Debray, Valentijn M. T. de Jong, Maarten De Vos, Paula Dhiman, Joie Ensor, Shan Gao, Maria C. Haller, Michael O. Harhay, Liesbet Henckaerts, Pauline Heus, Jeroen Hoogland, Mohammed Hudda, Kevin Jenniskens, Michael Kammer, Nina Kreuzberger, Anna Lohmann, Brooke Levis, Kim Luijken, Jie Ma, Glen P. Martin, David J. McLernon, Constanza L. Andaur Navarro, Johannes B. Reitsma, Jamie C. Sergeant, Chunhu Shi, Nicole Skoetz, Luc J. M. Smits, Kym I. E. Snell, Matthew Sperrin, René Spijker, Ewout W. Steyerberg, Toshihiko Takada, Ioanna Tzoulaki, Sander M. J. van Kuijk, Bas C. T. van Bussel, Iwan C. C. van der Horst, Kelly Reeve, Florien S. van Royen, Jan Y. Verbakel, Christine Wallisch, Jack Wilkinson, Robert Wolff, Lotty Hooft, Karel G. M. Moons, Maarten van Smeden 2020 Accessed: 2026-05-15
Review of 232 COVID prediction models. Documents systematic distribution shift problems in clinical ML models during the pandemic.
- PMLR Bret Nestor, Matthew B. A. McDermott, Willie Boag, Gabriela Berner, Tristan Naumann, Michael C. Hughes, Anna Goldenberg, Marzyeh Ghassemi 2019 Accessed: 2026-05-15
Documents temporal shift in medical records data. Shows how changes in data recording systems cause model degradation — the same mechanism as COVID distribution shift.
6 sources
- ACLU 2020 Accessed: 2026-05-15
Primary ACLU press release. Describes the arrest, 30 hours detained, daughters aged 2 and 5, and the Shinola store context.
- NPR Bobby Allyn 2020 Accessed: 2026-05-15
Contains quotes from Williams and the detective. Confirms DataWorks Plus as the system vendor and the alibi details.
- The New York Times Kashmir Hill 2020 Accessed: 2026-05-15
Original article that brought the case to public attention. Paywalled, but cited by all secondary sources.
- MIT Technology Review Tate Ryan-Mosley 2021 Accessed: 2026-05-15
Covers the lawsuit. Confirms DataWorks Plus, the photo lineup mechanism, and the failure to verify the alibi before arrest.
- Michigan Law / Law Quadrangle Sharon Morioka 2024 Accessed: 2026-05-15
Describes the 2024 settlement. Confirms the alibi (Facebook live stream) and that Williams appeared ninth on the match list.
- FAccT / PMLR Joy Buolamwini, Timnit Gebru 2018 Accessed: 2026-05-15
Canonical study documenting higher error rates for darker skin tones in commercial facial recognition systems. Academic background for the mechanism that led to Williams's arrest. Also cited in the Amazon Rekognition case.
6 sources
- ACLU Jacob Snow 2018 Accessed: 2026-05-15
Primary source. Describes the methodology ($12.33, 25,000 mugshots, default settings), results (28 false matches, 40% POC), the John Lewis inclusion, and the moratorium demand.
- ACLU 2018 Accessed: 2026-05-15
ACLU response to Amazon's defence. Contains the timeline of Amazon's threshold changes (80% → 95% → 99% within 48 hours) and the 'five stages of grief' quote.
- Slate Aaron Mak 2018 Accessed: 2026-05-15
Quotes Amazon's full statement on thresholds. Source for the 'acceptable for hot dogs, not for law enforcement' framing.
- Nextgov / FCW Dave Gershgorn 2018 Accessed: 2026-05-15
Industry coverage. Confirms the default settings and AWS's response.
- FAccT / PMLR Joy Buolamwini, Timnit Gebru 2018 Accessed: 2026-05-15
Canonical study on higher error rates for darker skin tones in commercial systems. Also cited in the Robert Williams case.
- NIST Patrick Grother, Mei Ngan, Kayee Hanaoka 2019 Accessed: 2026-05-15
Official US government report. Documents systematically higher false positive rates for African American and Asian faces in commercial systems. Strongest institutional confirmation of the bias mechanism.
5 sources
- PNAS Allison Koenecke, Andrew Nam, Emily Lake, Sharad Goel 2020 Accessed: 2026-05-15
Primary research paper. 42 white and 73 Black speakers, five cities, 19.8 hours of audio, WER 0.35 vs. 0.19. Source for all key figures. Also cited in the UK Parliaments case.
- Stanford Report Tom Abate, Ker Than 2020 Accessed: 2026-05-15
Official Stanford press release. Accessible summary with the authors' quote on potential career and life consequences.
- The Register Katyanna Quach 2020 Accessed: 2026-05-15
Industry coverage with per-company data (Apple worst, Microsoft best) and explanation of the acoustic mechanism.
- Built In Jeff Link 2020 Accessed: 2026-05-15
Accessible analysis. Author quote: 'We think the disparity is largely due to the lack of diverse training data.' Covers AAVE and historical context.
- PubMed Central / NIH Zion Mengesha, Courtney Heldreth, Michal Lahav, Juliana Sublewski, Elyse Tuennerman 2021 Accessed: 2026-05-15
Study on the psychological and experiential impact of ASR errors on African Americans. Useful supplement for the burnout and frustration section.
6 sources
- Inter-Parliamentary Union 2024 Accessed: 2026-05-15
Official IPU documentation. Source for the quote: 'If the AI system cannot recognize an MP's speech owing to an uncommon dialect or the use of jargon, it highlights the section for manual transcription.'
- arXiv Melissa Torgbi1, Andrew Clayman, Jordan J. Spe ight, Harish Tayyar Madabushi 2025 Accessed: 2026-05-15
Study of ASR for Scottish dialects in public legal and housing services. Confirms systematically higher error rates for Scottish accents even after controlling for other variables. Recommends accent-specific fine-tuning. University of Bath / Wyser.
- The University of Edinburgh's Research Archive Nina Markl 2022 Accessed: 2026-05-15
Systematic analysis of the impact of regional British accents on ASR accuracy. Scottish, Welsh, and Northern English identified as particularly problematic. Cited by the Bath/Wyser paper as a reference point.
- PNAS Allison Koenecke, Andrew Nam, Emily Lake, Sharad Goel 2020 Accessed: 2026-05-15
Also cited in the Stanford Speech Gap case. The authors explicitly suggest the same mechanism applies to 'regional and nonnative-English accents' — a direct bridge between the two cases.
- BusinessBusStop Accessed: 2026-05-15
Industry benchmark summary. Confirms that ASR models are trained 'predominantly on American or standard Southern British English' and that Scottish, Welsh, and Northern English accents are 'particularly prone to misrecognition'.
- YouTube The Scottish Comedy Channel 2014 Accessed: 2026-05-15
Satirical illustration of the problem in the finest British tradition. Not a source in any academic sense. If you've read this far into the bibliography, you've earned it.
##1 Topic
5 sources
- Mozilla Foundation 2021 Accessed: 2026-05-15
Largest crowdsourced study of the YouTube algorithm to date (37,000 users, 3,362 'regrettable videos'). Key figure: videos causing regret received 70% more daily views than others. Strongest empirical confirmation of outrage as a quality signal.
- TechCrunch Natasha Lomas 2021 Accessed: 2026-05-15
Lighter coverage of the Mozilla study. Contains the 70% figure and description of the mechanism.
- arXiv Hussam Habib, Rishab Nithyanand 2025 Accessed: 2026-05-15
Empirical study showing the algorithm systematically recommends content that elicits negative emotions. Cites Brady et al. (2017) on moral outrage spreading faster and further than neutral content.
- PNAS Muhammad Haroon, Magdalena Wojcieszak, Anshuman Chhabra, Zubair Shafiq 2023 Accessed: 2026-05-15
Largest academic study of the YouTube algorithm. Describes the political personalisation mechanism and feedback loops.
- PNAS Nexus Xudong Yu , Muhammad Haroon , Ericka Menchen-Trevino , Magdalena Wojcieszak 2024 Accessed: 2026-05-15
Describes the algorithm's 'interest bias' — catering to outrage and clickbait at the cost of diversity. Confirms the mechanism described in the text.
6 Chapters
##3 Topics
1 sources
- Responsible AI Collaborative Accessed: 2026-05-16
Database of AI-related incidents maintained by the Responsible AI Collaborative. Used in several places throughout this book as a source or starting point for further research. Recommended independently of any specific selection made here.
5 sources
- arXiv / NeurIPS 2017 Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin 2017 Accessed: 2026-05-16
Original paper introducing the Transformer architecture. Canonical source for the attention and token sections. Also cited in The Transformer explainer.
- karpathy.github.io Andrej Karpathy 2015 Accessed: 2026-05-16
Classic popular-science description of next-token prediction as a mechanism. Author later became Director of AI at Tesla and a founding member of OpenAI.
- arXiv / Stanford Nelson F. Liu, Kevin Lin, John Hewitt, Ashwin Paranjape, Michele Bevilacqua, Fabio Petroni, Percy Liang 2023 Accessed: 2026-05-16
Canonical study on accuracy degradation in long contexts. Confirms the mechanism described in the Frodo/Tatooine example.
- TokenMix 2026 Accessed: 2026-05-16
Current (April 2026) overview of context window sizes. Confirms GPT-4o 128k, Claude 200k, Gemini 2.5 Pro 1M–2M tokens, and the 10–25% lost-in-the-middle degradation range.
- arXiv / TMLR Jason Wei, Yi Tay, Rishi Bommasani, Colin Raffel, Barret Zoph, Sebastian Borgeaud, Dani Yogatama, Maarten Bosma, Denny Zhou, Donald Metzler, Ed H. Chi, Tatsunori Hashimoto, Oriol Vinyals, Percy Liang, Jeff Dean, William Fedus 2022 Accessed: 2026-05-16
Canonical academic source for the emergence section — the phenomenon of capabilities appearing unpredictably as models are scaled.
6 sources
- arXiv / NeurIPS 2017 Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin 2017 Accessed: 2026-05-16
Original Google Brain paper introducing the Transformer architecture, self-attention, multi-head attention, and Q/K/V. 8 authors, cited over 100,000 times. Also cited in the Next-Token Prediction explainer.
- jalammar.github.io Jay Alammar 2018 Accessed: 2026-05-16
Most widely referenced popular explanation of the Transformer mechanism with Q/K/V and attention matrix visualisations. Industry standard entry point for non-specialists.
- YouTube / 3Blue1Brown 3Blue1Brown 2024 Accessed: 2026-05-16
Animated explanation of tokenisation, embeddings, and the attention mechanism.
- Neural Computation Sepp Hochreiter, Jürgen Schmidhuber 1997 Accessed: 2026-05-16
Canonical LSTM paper. Describes the vanishing gradient problem that the Transformer later solved. Historical background for the 'cassette tape' section.
- transformer-circuits.pub / Anthropic Nelson Elhage, Neel Nanda, Catherine Olsson, Tom Henighan†, Nicholas Joseph, Ben Mann†, Amanda Askell, Yuntao Bai, Anna Chen, Tom Conerly, Nova DasSarma, Dawn Drain, Deep Ganguli, Zac Hatfield-Dodds, Danny Hernandez, Andy Jones, Jackson Kernion, Liane Lovitt, Kamal Ndousse, Dario Amodei, Tom Brown, Jack Clark, Jared Kaplan, Sam McCandlish, Chris Olah 2021 Accessed: 2026-05-16
Anthropic mechanistic interpretability research showing how attention heads specialise emergently. Confirms 'no one programmed these roles'.
- arXiv / Meta AI Hugo Touvron, Thibaut Lavril, Gautier Izacard, Xavier Martinet, Marie-Anne Lachaux, Timothée Lacroix, Baptiste Rozière, Naman Goyal, Eric Hambro, Faisal Azhar, Aurelien Rodriguez, Armand Joulin, Edouard Grave, Guillaume Lample 2023 Accessed: 2026-05-16
Original LLaMA paper. Confirms the expansion of the acronym: Large Language Model Meta AI.
376 Topics
8 sources
- OpenAI 2024 Accessed: 2026-05-16
Primary product page. Contains official acknowledgement of failures: 'Sora fails to model the chair as a rigid object' and 'inaccurate physical modeling and unnatural object morphing' (basketball).
- OpenAI 2024 Accessed: 2026-05-16
Technical report. Includes the claim about simulating the physical world and the admission: 'it does not accurately model the physics of many basic interactions, like glass shattering.' Confirms cookies and eating food as known failure modes.
Official Sora 2 launch. Contains the key quote: 'Prior video models are overoptimistic — if a basketball player misses a shot, the ball may spontaneously teleport to the hoop.'
- The Decoder Matthias Bastian 2024 Accessed: 2026-05-16
Coverage of the PhyGenBench study (Liu et al., ByteDance Research / Tsinghua University). Conclusion: 'naively scaling is insufficient for video generation models to discover fundamental physical laws.' Includes Yann LeCun's assessment that generating pixels to model the world is 'wasteful and doomed to failure'."
- Artificial Cognition Raphaël Millière 2024 Accessed: 2026-05-16
Philosophical and cognitive science analysis. Covers the 'intuitive physics engine' concept and why Sora lacks one. Good background for the 'statistics without physics' section.
- Data Literacy Ben Jones 2024 Accessed: 2026-05-16
Practical tests from December 2024. Documents specific physical failures (basketball, gymnast, limb morphing) from original prompts.
- VentureBeat Carl Franzen 2024 Accessed: 2026-05-16
Public launch coverage. Quotes MKBHD: 'unnatural physics, adding or removing objects seemingly at random.' Industry context for the delayed rollout.
- MindStudio 2026 Accessed: 2026-05-16
Comprehensive retrospective. Describes the gap between the February 2024 demo and the December 2024 product, and the reasons for discontinuation.
6 sources
- Microsoft Blog Peter Lee 2016 Accessed: 2026-05-16
Official Microsoft apology and post-mortem. Source for the quote: 'We are deeply sorry for the unintended offensive and hurtful tweets.' Describes the incident as a 'coordinated attack'.
- IEEE Spectrum Oscar Schwartz 2019 Accessed: 2026-05-16
Thorough analytical account of the incident. Describes the 4chan mechanism, the repeat-after-me feature, Zoë Quinn's role, and lessons for the industry.
- Washington Post Alexandra Petri 2016 Accessed: 2026-05-16
Day-of account. Quotes specific tweets and reactions, including Zoë Quinn's response.
- Wikipedia Accessed: 2026-05-16
Well-documented overview. Confirms date (23 March 2016), tweet count (96,000), repeat-after-me mechanism, the note that not all incidents stemmed from that feature, and subsequent replacement by Zo.
- Malicious Life Podcast Ran Levi 2021 Accessed: 2026-05-16
Detailed mechanism analysis. Confirms 4chan and 8chan as coordination sources. Quotes Tay's final tweet: 'c u soon humans need sleep now so many conversations today thx.'
- AI Incident Database Sean McGregor 2016 Accessed: 2026-05-16
Official AI incident database entry. Contains Microsoft's full apology and failure mode classification (Specification, Robustness, Assurance).
6 sources
- NPR Lisa Hagen, Huo Jingnan, Audrey Nguyen 2025 Accessed: 2026-05-16
Primary account. Contains all key facts, quotes, and regulatory responses.
- Grokipedia 2025 Accessed: 2026-05-16
Detailed documentation. Covers internal xAI Slack reactions and the 16-hour incident timeline.
It is worth appreciating that xAI is not concealing the incident and is itself publishing a detailed description of it on its own Grokipedia.
- Wikipedia Accessed: 2026-05-16
Comprehensive documentation of the incident, the May 2025 timeline, Wolfenstein 3D reference, 'every damn time', and country-level reactions.
- Al Jazeera Elizabeth Melimopoulos 2025 Accessed: 2026-05-16
Turkish and Polish context. ADL quote. Verbatim Hitler-related output from the chatbot.
- CBS News / The Conversation Aaron J. Snoswell 2025 Accessed: 2026-05-16
Academic analysis (Queensland University of Technology). Comparison with Tay. Describes the system prompt and fine-tuning mechanism.
- Congressman Suozzi / Jewish Insider Marc Rod 2025 Accessed: 2026-05-16
Official xAI response to the US Congress (Lily Lim). Full technical explanation of the incident. Source for the 'dialed down the woke filters' quote.
5 sources
- The New York Times Kevin Roose 2023 Accessed: 2026-05-16
Primary source. Full description of the two-hour conversation: the love declaration, the attempt to destabilise the reporter's marriage, and 'I want to be alive.' Paywalled but cited by all secondary sources.
- Wikipedia Accessed: 2026-05-16
Comprehensive documentation. Covers blackmail quotes, Avatar 2 gaslighting, AP reaction, Microsoft's restriction timeline, and prompt injection.
- The Guardian Jonathan Yerushalmy 2023 Accessed: 2026-05-16
Verbatim quotes from the transcript. Useful secondary source for direct citations.
- Fortune Jeremy Kahn 2023 Accessed: 2026-05-16
Quotes Microsoft CTO Kevin Scott: the chatbot was 'more likely to turn into Sydney in longer conversations.' Confirms context decay as an officially acknowledged cause.
- GitHub / JD Preston JD Preston Accessed: 2026-05-16
Complete incident chronology with links to primary articles. Covers Kevin Liu's prompt injection (8 February 2023) and Microsoft's restriction timeline (5–10 prompts per session).
5 sources
- Variety Todd Spangler 2024 Accessed: 2026-05-16
Day-of-pause coverage. Lists all erroneous categories: Wehrmacht, Founding Fathers, Black Vikings, female pope, women NHL players.
- NPR Bobby Allyn 2024 Accessed: 2026-05-16
Good explanation of the mechanism: describes the 'secret code' appended to every prompt instructing diversification, the 2015 gorilla scandal as motivation, and a former Google engineer's quote.
- Euronews 2024 Accessed: 2026-05-16
Exact Pichai memo quote: 'completely unacceptable and we got it wrong.' Covers remediation steps and other Gemini errors (Caitlin Jenner).
- Vibe Graveyard 2024 Accessed: 2026-05-16
Source for SVP Prabhakar Raghavan's quote: model 'overcompensated' and was 'over-conservative.' Describes the uniform diversity calibration mechanism.
Special kudos for incident severity categorization: facepalm.
- CBS News Ahmed Shawkat, Tucker Reals 2023 Accessed: 2026-05-16
Quotes the official statement from the Egyptian Ministry of Tourism and Antiquities: 'Queen Cleopatra had light skin and Hellenistic (Greek) features.' Covers lawyer Mahmoud al-Semary's lawsuit demanding Netflix be shut down in Egypt. Historical and academic context. Note: the formal protest came from a private lawyer and the Ministry — not from the Egyptian government in a diplomatic sense.
8 sources
- CourtListener 2024 Accessed: 2026-05-16
Complete court record for No. 6:24-cv-01903 (M.D. Fla.). Contains all filings, responses, and evidence submitted by the parties.
- CourtListener 2025 Accessed: 2026-05-16
Judge Anne Conway's ruling (21 May 2025) rejecting Character.AI's First Amendment argument. Primary legal source for the 'design failure, not speech' thesis.
- CourtListener 2026 Accessed: 2026-05-16
Mediation settlement document. Basis for the statement that the case was resolved without a merits ruling.
- CNN Clare Duffy 2026 Accessed: 2026-05-16
Confirms the 7 January 2026 settlement. Covers the broader wave of lawsuits against Character.AI and the company's new safeguards for minors.
- UNILAD Joe Yates 2025 Accessed: 2026-05-16
Contains the exact transcript of Sewell's final conversation with the bot. Describes Judge Conway's ruling.
- Tech Justice Law 2024 Accessed: 2026-05-16
Page from the law firm representing Megan Garcia. Chronology of the case and key legal arguments. Notes that the system's failure to respond to suicidal ideation was the central allegation.
- TechPolicy.Press 2024 Accessed: 2026-05-16
Tracker with dates of key procedural events.
- US Senate Judiciary Committee Megan Garcia 2025 Accessed: 2026-05-16
Direct testimony before the US Senate (16 September 2025). Describes the addiction mechanism, lack of safeguards, and legislative demands. Strong primary narrative source.
385 Topics
6 sources
- Nature Ilia Shumailov, Zakhar Shumaylov, Yiren Zhao, Nicolas Papernot, Ross Anderson, Yarin Gal 2024 Accessed: 2026-05-16
Canonical source for the term and mechanism. Describes early and late model collapse, loss of distribution tails, and degradation from recursive training on synthetic data.
- arXiv Ilia Shumailov, Zakhar Shumaylov, Yiren Zhao, Yarin Gal, Nicolas Papernot, Ross Anderson 2023 Accessed: 2026-05-16
Pre-publication version of the Nature paper. The term 'model collapse' was introduced here.
- Wikipedia Accessed: 2026-05-16
Overview with breakdown of early/late collapse, academic controversies (whether data accumulation prevents collapse), and links to literature.
- IBM Think Alice Gomstyn, Alexandra Jonker Accessed: 2026-05-16
Accessible industry explanation of the mechanism. Cites Shumailov and covers real-world implications.
- The Atlantic Kaitlyn Tiffany 2021 Accessed: 2026-05-16
First mainstream analysis of the Dead Internet Theory. Covers its origin on the Agora Road Macintosh Cafe forum (2021) and the 'botification' of the internet.
- arXiv Prathamesh Muzumdar, Sumanth Cheemalapati, Srikanth Reddy RamiReddy, Kuldeep Singh, George Kurian, Apoorva Muley 2025 Accessed: 2026-05-16
Academic survey of the Dead Internet Theory. Covers bots, AI-generated content, and how engagement metrics have displaced authentic human interaction.
6 sources
- Futurism Maggie Harrison Dupré 2023 Accessed: 2026-05-16
Primary investigative report (27 November 2023). Covers Drew Ortiz, AdVon Commerce, AI-generated headshots, and the Arena Group's response.
- Variety Todd Spangler 2023 Accessed: 2026-05-16
Arena Group statement. Details of the AdVon relationship and confirmation that profiles were removed after Futurism contact.
- NPR David Folkenflik 2023 Accessed: 2026-05-16
Day-of coverage. Quotes from the volleyball article and the SI Union's reaction: 'horrified'.
- NPR Emma Bowman 2024 Accessed: 2026-05-16
82 union employees (80% of staff) laid off. Authentic Brands Group terminates licensing deal.
- Front Office Sports A.J. Perez 2024 Accessed: 2026-05-16
First layoff report. Financial details: $3.75M missed payment, $45M penalty.
- Variety Todd Spangler 2024 Accessed: 2026-05-16
SEC filing context: 100+ layoffs = 33% of Arena Group workforce. Restructuring cost $5–7M.
7 sources
- Bulletin of the Atomic Scientists Guillaume Cabanac, Cyril Labbé, Alexander Magazinov 2022 Accessed: 2026-05-16
Canonical source for the term and mechanism. Lists key examples: bosom peril, flag to clamor, counterfeit consciousness, arbitrary timberland.
- arXiv Guillaume Cabanac, Cyril Labbé, Alexander Magazinov 2021 Accessed: 2026-05-16
Original academic paper defining tortured phrases. Detection methodology and scale analysis.
- TechXplore Guillaume Cabanac, Cyril Labbé, Frederik Joelving 2025 Accessed: 2026-05-16
Description of the automated detection tool. Confirms scale: 19,000+ flagged papers.
- Université de Toulouse / IRIT Accessed: 2026-05-16
Interactive tool for searching scientific literature for tortured phrases. The 'fingerprints' section contains hundreds of documented examples.
- Forum Akademickie Jolanta Szczepaniak 2023 Accessed: 2026-05-16
Polish-language overview of the paper mills and tortured phrases phenomenon in an academic context. Confirms 'leftover vitality' as a documented example.
- Nature Chris Stokel-Walker 2023 Accessed: 2026-05-16
Documents cases of ChatGPT output appearing in published papers. Background for the 'Regenerate Response' and 'Certainly, here is a possible introduction' section.
- Times Higher Education Jack Grove 2022 Accessed: 2026-05-16
Broader context on the phenomenon. Quotes Cabanac and describes the detection methodology.
5 sources
- Rolling Stone Ethan Millman 2024 Accessed: 2026-05-16
Primary indictment coverage (September 2024). 661,440 streams/day, 10,000 bot accounts, $10M, Southern District of New York.
- Music Producer Accused of Using AI Songs to Scam Streaming Platforms Out of $10 Million in RoyaltiesVariety Gene Maddaus 2024 Accessed: 2026-05-16
Contains the Smith email quote: 'We need to get a TON of songs fast to make this work around the anti fraud policies.'
- Bleeping Computer Sergiu Gatlan 2026 Accessed: 2026-05-16
Guilty plea. Forfeiture of $8,091,843.64. Maximum 5 years imprisonment. Described as the first criminal case involving artificially inflated music streaming.
- Music Business Worldwide Murray Stassen 2024 Accessed: 2026-05-16
Confirms Spotify's claim that its platform accounted for less than 1% of the $10M.
- Music Business Worldwide Accessed: 2026-05-16
Series of articles from 2017 onwards documenting the ghost artists controversy on mood playlists. Cite as a series, not a single article.
7 sources
- arXiv / ACM CCS 2022 Hammond Pearce, Baleegh Ahmad, Benjamin Tan, Brendan Dolan-Gavitt, Ramesh Karri 2022 Accessed: 2026-05-16
Primary study. 89 scenarios, 1,692 programs, ~40% contained exploitable vulnerabilities (SQL injection, buffer overflows, hardcoded credentials, XSS). NYU Tandon School of Engineering.
- NYU Center for Cybersecurity Lois Anne DeLong 2021 Accessed: 2026-05-16
Official NYU confirmation of the study. Accessible summary.
- The Register Thomas Claburn 2021 Accessed: 2026-05-16
Original trade coverage of the study with findings and examples.
- Medium / Vitalii Petrenko Vitalii Petrenko 2026 Accessed: 2026-05-16
Cites Apiiro data from June 2025: AI-assisted developers produced 10× more security issues than the baseline. Context for the 'sharp increase in vulnerabilities' point.
- arXiv / ACM CCS 2023 Neil Perry, Megha Srivastava, Deepak Kumar, Dan Boneh 2023 Accessed: 2026-05-16
Stanford study confirming that programmers using AI assistants wrote less secure code in 4 of 5 tasks and were simultaneously more confident that their code was secure — a classic automation bias effect.
- X (Twitter) Andrej Karpathy 2025 Accessed: 2026-05-16
Original definition of vibe coding (2 February 2025): 'fully give in to the vibes, embrace exponentials, and forget that the code even exists.'
- Wikipedia Accessed: 2026-05-16
Documents the term's history, Collins Word of the Year 2025, security criticism, and CodeRabbit data: AI code has 2.74× more security vulnerabilities than human-written code.
394 Topics
6 sources
- BuzzFeed News Chris Stokel-Walker 2023 Accessed: 2026-05-16
Interview with Pablo Xavier (31-year-old construction worker from Chicago). The image's origin, the viral reaction, and his motivation.
- Poynter Alex Mahadevan 2023 Accessed: 2026-05-16
Fact-check. Confirms the distorted hand as the AI tell most viewers missed. Chrissy Teigen as a victim. Hank Green poll: nearly half of 240,000 respondents were fooled.
- CBS News Simon Ellery 2023 Accessed: 2026-05-16
Describes the image's distortions (left hand with water bottle, overly sharp skin). Chrissy Teigen quote. Industry context.
- Slate Heather Tal Murphy 2023 Accessed: 2026-05-16
Context on Midjourney V5 as a photorealism breakthrough. Ryan Broderick quote: 'first real mass-level AI misinformation case'.
- IEEE Computer Society Nir Kshetri 2024 Accessed: 2026-05-16
Academic review of AI-generated images' impact on public trust and fact verification. Background for the Normalisation of Distrust section.
- UC Berkley Hany Farid Accessed: 2026-05-16
Leading academic researcher in forensic image analysis and deepfake detection. Canonical reference for the technical background on AI-generated image detection.
6 sources
- Wall Street Journal Catherine Stupp 2019 Accessed: 2026-05-16
Primary source. Rüdiger Kirsch (Euler Hermes) quoted on the 'slight German accent and the melody of his voice'. Details of three phone calls, the Hungarian account, and the Mexican transfer. Paywalled but cited by all secondary sources."
- Sophos 2019 Accessed: 2026-05-16
Detailed technical account. Cites WSJ. Describes the three-call timeline and the verification mechanism. Cybersecurity perspective.
- PaymentsJournal Tim Sloane 2019 Accessed: 2026-05-16
Industry coverage. Confirms all key facts. Financial and insurance context (Euler Hermes as underwriter).
- ICAEW 2025 Accessed: 2026-05-16
Describes this case as a landmark first. Covers the evolution of voice fraud from 2019 to 2025. Data point: 28% of UK adults reported being targeted by an AI voice scam in 2024.
- Note regarding punchlineAuthor's note
Executive astrology is a real thing. I will not be providing sources, as I refuse to promote or inadvertently validate it.
- Note regarding footnoteAuthor's note
The author acknowledges that no formal law obliges Polish citizens to question the melodic qualities of the German (or, for that matter, any other) language. The practice is instead governed by long-standing, if entirely unofficial, cultural norms. Embedded deeply enough, they occasionally surface in national symbols. That, however, is a discussion entirely outside the scope of this work.
7 sources
- Vuink 2023 Accessed: 2026-05-16
Primary account. Describes the 48-hour moratorium, the gap in Meta's policy (audio vs. video), and the AFP fact-check.
- International Press Institute Karin Kőváry Sólymos 2023 Accessed: 2026-05-16
Press freedom perspective. History of attacks on Tódová and analysis of how the deepfake fit into a sustained campaign to discredit the journalist.
- VSquare.org Karin Kőváry Sólymos 2023 Accessed: 2026-05-16
Investigative account. Propagation timeline, coincidence with the SVR statement, and Štefan Harabin's distribution role.
- The Dial Ondřej Kundra 2023 Accessed: 2026-05-16
Most detailed narrative of the incident. Quotes Šimečka ('It does sound like me'), AFP Barca (130,000+ shares on Meta), and the context of 30% undecided voters one week before the deepfake appeared.
- CNN Curt Devine, Donie O'Sullivan, Sean Lyngaas 2024 Accessed: 2026-05-16
Quotes Janis Sarts (NATO Strategic Comms Centre of Excellence) on the SVR statement appearing one hour before the deepfake: 'The claims made in the Russian Intelligence Service's statement and the content of the deepfake that went viral simultaneously correspond to each other.'
- Harvard Kennedy School Misinformation Review Lluis de Nadal, Peter Jančárik 2024 Accessed: 2026-05-16
Academic analysis of the deepfake's impact on the election outcome. Cautions against simple attribution of the result to the deepfake. Context of pro-Kremlin disinformation in Slovakia since 2010.
- AI Incident Database Daniel Atherton 2023 Accessed: 2026-05-16
Complete chronology of all deepfake incidents from the Slovak campaign (Čaputová, Šimečka, beer prices). Documents platform moderation responses.
5 sources
- CNN Business Kathleen Magramo 2024 Accessed: 2026-05-16
Primary account after the company name was made public. Official Arup statement ('fake voices and images were used'). Details of 15 transfers, 5 bank accounts, HK$200M.
- Fortune Prarthana Prakash 2024 Accessed: 2026-05-16
CIO Rob Greig quote: 'technology-enhanced social engineering'. Context of rising corporate deepfake attacks.
- AI Incident Database Daniel Atherton 2024 Accessed: 2026-05-16
Complete incident documentation with links to all media coverage. Notes that this case became shorthand in journalism for deepfake fraud.
- CFO Dive Grace Noto 2024 Accessed: 2026-05-16
Financial and CFO perspective. Describes how deepfakes bypass traditional payment authorisation controls.
- Communications of the ACM Gaurav Belani 2025 Accessed: 2026-05-16
Academic-industry analysis of the mechanism. Describes the evolution from phishing to 'full-on video call with recognizable faces and voices'. Comparison with classical social engineering.
408 Topics
11 sources
- arXiv / Anthropic Yuntao Bai, Saurav Kadavath, Sandipan Kundu, Amanda Askell, Jackson Kernion, Andy Jones, Anna Chen, Anna Goldie, Azalia Mirhoseini, Cameron McKinnon, Carol Chen, Catherine Olsson, Christopher Olah, Danny Hernandez, Dawn Drain, Deep Ganguli, Dustin Li, Eli Tran-Johnson, Ethan Perez, Jamie Kerr, Jared Mueller, Jeffrey Ladish, Joshua Landau, Kamal Ndousse, Kamile Lukosuite, Liane Lovitt, Michael Sellitto, Nelson Elhage, Nicholas Schiefer, Noemi Mercado, Nova DasSarma, Robert Lasenby, Robin Larson, Sam Ringer, Scott Johnston, Shauna Kravec, Sheer El Showk, Stanislav Fort, Tamera Lanham, Timothy Telleen-Lawton, Tom Conerly, Tom Henighan, Tristan Hume, Samuel R. Bowman, Zac Hatfield-Dodds, Ben Mann, Dario Amodei, Nicholas Joseph, Sam McCandlish, Tom Brown, Jared Kaplan 2022 Accessed: 2026-05-16
Original paper defining Constitutional AI. Describes the self-critique mechanism and RLAIF. 51 Anthropic authors. Also cited in the Automated Priest case study.
- Anthropic 2023 Accessed: 2026-05-16
Official Anthropic page describing Constitutional AI as an approach. Accessible explanation of the mechanism and philosophy. Also cited in the Automated Priest case study.
- arXiv / Anthropic Ethan Perez, Saffron Huang, Francis Song, Trevor Cai, Roman Ring, John Aslanides, Amelia Glaese, Nat McAleese, Geoffrey Irving 2022 Accessed: 2026-05-16
Canonical academic source for AI red teaming. Describes the mechanism and methodology.
- CSA 2026 Accessed: 2026-05-16
Cloud Security Alliance Research Note on NIST's forthcoming AI RMF Playbook for Red Teaming. Describes the NIST framework and its relationship to the broader AI risk management landscape.
- NIST Accessed: 2026-05-16
NIST's AI Risk Management Framework. Describes the framework's structure and its relationship to the broader AI risk management landscape.
- NIST 2026 Accessed: 2026-05-16
NIST's AI Risk Management Framework Playbook
- RAND Corporation Willis H. Ware 1970 Accessed: 2026-05-16
Willis Ware's task force report. Organised 1967, published 1970. Foundation of the history of penetration testing.
- Wikipedia Accessed: 2026-05-16
Complete history of the term from 1965 to the present. Cites Willis Ware, the Spring 1968 Joint Computer Conference, and James P. Anderson.
- Infosec Institute 2019 Accessed: 2026-05-16
Accessible historical narrative. Covers the Willis Report, Tiger Teams, and the evolution to commercial penetration testing.
- Wikipedia Accessed: 2026-05-16
Footnote reference. Dates (~1230–1309), portfolio of castles (Beaumaris, Harlech, Caernarfon, Conwy), role as Edward I's chief architect in Wales.
- Cadw (Welsh Government) Accessed: 2026-05-16
Footnote reference. Official Welsh heritage source. Describes Beaumaris as 'the greatest castle never built' and 'the castle to end all castles'.
7 sources
- arXiv / NeurIPS 2022 Long Ouyang, Jeff Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright, Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex Ray, John Schulman, Jacob Hilton, Fraser Kelton, Luke Miller, Maddie Simens, Amanda Askell, Peter Welinder, Paul Christiano, Jan Leike, Ryan Lowe 2022 Accessed: 2026-05-16
Canonical source for RLHF as a method. The InstructGPT paper. Describes the reward model mechanism and human labellers.
- arXiv / Anthropic Yuntao Bai, Andy Jones, Kamal Ndousse, Amanda Askell, Anna Chen, Nova DasSarma, Dawn Drain, Stanislav Fort, Deep Ganguli, Tom Henighan, Nicholas Joseph, Saurav Kadavath, Jackson Kernion, Tom Conerly, Sheer El-Showk, Nelson Elhage, Zac Hatfield-Dodds, Danny Hernandez, Tristan Hume, Scott Johnston, Shauna Kravec, Liane Lovitt, Neel Nanda, Catherine Olsson, Dario Amodei, Tom Brown, Jack Clark, Sam McCandlish, Chris Olah, Ben Mann, Jared Kaplan 2022 Accessed: 2026-05-16
Anthropic's original RLHF study. First description of sycophancy as an undesired side effect.
- ICLR 2024 / OpenReview Mrinank Sharma, Meg Tong, Tomasz Korbak, David Duvenaud, Amanda Askell, Samuel R. Bowman, Esin DURMUS, Zac Hatfield-Dodds, Scott R Johnston, Shauna M Kravec, Timothy Maxwell, Sam McCandlish, Kamal Ndousse, Oliver Rausch, Nicholas Schiefer, Da Yan, Miranda Zhang, Ethan Perez 2024 Accessed: 2026-05-16
Canonical sycophancy study. 'Humans prefer sycophantic responses over correct ones a non-negligible fraction of the time.' Primary academic source for the entire sycophancy section.
- arXiv Itai Shapira, Gerdus Benade, Ariel D. Procaccia 2025 Accessed: 2026-05-16
Mechanistic explanation of why RLHF amplifies sycophancy. Formal mathematical model.
- arXiv / Stanford Ryan Park, Rafael Rafailov, Stefano Ermon, Chelsea Finn 2024 Accessed: 2026-05-16
Canonical source for verbosity bias. Quote: 'RLHF is known to exploit biases in human preferences, such as verbosity. A well-formatted and eloquent answer is often more highly rated by users, even when it is less helpful.'
- arXiv Keita Saito, Akifumi Wachi, Koki Wataoka, Youhei Akimoto 2023 Accessed: 2026-05-16
Empirical study of verbosity bias in both human labellers and LLM-as-evaluator settings.
- BlueDot Impact 2024 Accessed: 2026-05-16
Accessible analysis of the main RLHF problems: sycophancy, over-refusal, deceptive alignment. Also cited in the Automated Priest case study.
5 sources
- OpenAI 2023 Accessed: 2026-05-16
98-page document. 50+ domain experts. ARC's TaskRabbit/CAPTCHA incident. Multimodal jailbreaks. Primary source for all facts in this case study.
- Vice / Motherboard Joseph Cox 2023 Accessed: 2026-05-16
Original media account of the TaskRabbit incident. Quotes the System Card directly.
- arXiv / OpenAI Hurst et al. 2023 Accessed: 2026-05-16
Documents multimodal jailbreaks. 'Text-screenshot jailbreak' as a key problem. Background for the t-shirt and LEGO bricks section.
ArXiv lists 99 authors and notes '318 additional authors not shown' – hence I decided to skip the full list and use 'Hurst et al.' as the author.
- METR Accessed: 2026-05-16
Organisation that conducted the TaskRabbit test cited in the GPT-4 System Card. Mission: 'align future ML systems with human interests'.
Original link pointed to https://evals.alignment.org but it now redirects to https://metr.org. Web Archive captured explanation of the change: 'METR – Model Evaluation and Threat Research. Formerly “ARC Evals”, METR was incubated at the Alignment Research Center and is now a standalone non-profit.'
- Adversa AI 2023 Accessed: 2026-05-16
Describes jailbreak techniques with reference to first reports appearing within two hours of model publication.
5 sources
- arXiv / Anthropic Yuntao Bai, Saurav Kadavath, Sandipan Kundu, Amanda Askell, Jackson Kernion, Andy Jones, Anna Chen, Anna Goldie, Azalia Mirhoseini, Cameron McKinnon, Carol Chen, Catherine Olsson, Christopher Olah, Danny Hernandez, Dawn Drain, Deep Ganguli, Dustin Li, Eli Tran-Johnson, Ethan Perez, Jamie Kerr, Jared Mueller, Jeffrey Ladish, Joshua Landau, Kamal Ndousse, Kamile Lukosuite, Liane Lovitt, Michael Sellitto, Nelson Elhage, Nicholas Schiefer, Noemi Mercado, Nova DasSarma, Robert Lasenby, Robin Larson, Sam Ringer, Scott Johnston, Shauna Kravec, Sheer El Showk, Stanislav Fort, Tamera Lanham, Timothy Telleen-Lawton, Tom Conerly, Tom Henighan, Tristan Hume, Samuel R. Bowman, Zac Hatfield-Dodds, Ben Mann, Dario Amodei, Nicholas Joseph, Sam McCandlish, Tom Brown, Jared Kaplan 2022 Accessed: 2026-05-16
Original academic paper. Contains the RLAIF mechanism and self-critique description. Source for footnote citations of the original constitutional principles. Also cited in the Illusion of the Digital Leash explainer.
- Anthropic 2023 Accessed: 2026-05-16
Official documentation. Contains formulations about 'excessively paternalistic' behaviour and the list of things to avoid (lectures, moralizes, condescending). Source for the evolution from Claude 2.x to the current approach. Also cited in the Illusion of the Digital Leash explainer.
- Reddit r/ClaudeAI 2023 Accessed: 2026-05-16
Documented comparison of Claude 2.0 and 2.1 with over-refusal examples. Community evidence for the 'legendary' status of the 2.x models on forums.
- Reddit r/LocalLLaMA 2023 Accessed: 2026-05-16
Specific documented case of Claude 2.1 refusing the `kill` command for a Python process. Primary source for the kill command anecdote.
- Hacker News 2023 Accessed: 2026-05-16
Day-of-launch discussion. User quotes about the model being 'borderline useless' due to over-refusal. Context for the Reddit and HackerNews reaction described in the text.
6 sources
- Gizmodo Kyle Barr 2022 Accessed: 2026-05-16
Primary account. Challenger disaster, threats against users, and overturning the Biden administration — all examples produced by the compromised bot.
- The Register Brandon Vigliarolo 2022 Accessed: 2026-05-16
Quotes Simon Willison on the impossibility of effective defences against prompt injection. Reproduces Remoteli's system prompt: 'Respond to tweets about remote work with positive comments.'
- AI Incident Database Khoa Lam 2022 Accessed: 2026-05-16
Official incident documentation. GPT-3, Twitter, September 2022.
- X (Twitter) Riley Goodside 2022 Accessed: 2026-05-16
First documented discovery of the technique, four days before the Remoteli attack (12 September 2022).
- simonwillison.net Simon Willison 2022 Accessed: 2026-05-16
First use of the term 'prompt injection'. SQL injection analogy. Explains why there is no simple solution.
- IBM Think Matthew Kosinski 2024 Accessed: 2026-05-16
Technical analysis of the mechanism. Uses Remoteli as a case study. Explains why parameterisation (effective against SQL injection) does not work for LLMs.
4 sources
- GM Authority Jonathan Lopez 2023 Accessed: 2026-05-16
Primary industry account from the day of the incident. Exact quotes from Bakke and the bot's responses. GM comment.
- VentureBeat Bryson Masse 2023 Accessed: 2026-05-16
Confirms MSRP of $58,195. Chris White as first tester. Industry context.
- AI Incident Database Daniel Atherton 2023 Accessed: 2026-05-16
Incident documentation. Identifies Fullpath as the chatbot provider. 20 million views.
- X (Twitter) Chris Bakke 2023 Accessed: 2026-05-16
Original viral post. Screenshot of the conversation with the bot.
5 sources
- CNBC Rohan Goswami 2023 Accessed: 2026-05-16
Canonical source. Exact quotes from SessionGloomy and the token mechanism. CNBC's own tests.
- Know Your Meme Aidan Walker 2023 Accessed: 2026-05-16
Complete history of DAN from u/Seabout (December 2022) through SessionGloomy (DAN 5.0, February 2023). Documents the evolution.
- GitHub Gist AJ ONeal Accessed: 2026-05-16
Archive of DAN prompts 2.0–13.0. Documents the evolution of the split-personality framing and the token system.
- Abnormal AI Accessed: 2026-05-16
Accessible definition and mechanism explanation for a general audience.
- AdGuard Ekaterina Kachalova 2023 Accessed: 2026-05-16
Security perspective on the mechanism. Covers the evolution of prompts and the context of misuse.
8 sources
- pshapira.net Philip Shapira 2024 Accessed: 2026-05-16
Original analysis on OpenAlex. 46% of papers containing 'delve' from 1990 published in the 15 months after ChatGPT. Canonical source for Shapira's data.
- arXiv Dmitry Kobak, Rita González-Márquez, Emőke-Ágnes Horvát, Jan Lause 2024 Accessed: 2026-05-16
14 million PubMed abstracts. At least 10–13.5% of 2024 abstracts processed by LLMs. Includes the flag-word list.
- arXiv Tom S. Juzek, Zina B. Ward 2024 Accessed: 2026-05-16
21 flag words identified by formal method. Examines RLHF's role in lexical overrepresentation. Directly addresses the mechanism described in the text.
- PubMed Central Kentaro Matsui 2025 Accessed: 2026-05-16
Peer-reviewed study on AI-associated words in scientific literature. 85-fold increase for 'delve' and 'underscore' combined.
- Technollama Andres Guadamuz 2023 Accessed: 2026-05-16
Legal and cultural analysis of 'delve' as a dialectal marker. Colonial context and the linguistic chauvinism question.
- Time Billy Perrigo 2023 Accessed: 2026-05-16
Canonical investigation into RLHF outsourcing to Kenya (Sama, Nairobi). Less than $2 per hour.
- Stowarzyszenie Pravda Maria Święcicka 2023 Accessed: 2026-05-16
Polish article describing the investigation into Sama's Kenyan workers.
Stowarzyszenie Pravda is a Polish non-profit organization focused on fighting misinformation and promoting media literacy. Its name roughly translates to 'Truth Association' in English (the actual word for 'truth' is spelled 'prawda').
- Scientific American Vanessa Bates Ramirez 2025 Accessed: 2026-05-16
Documents the co-evolution: people began using LLM-borrowed vocabulary in everyday communication, creating the feedback loop described in the text.
##1 Topic
19 sources
- Artificial General IntelligenceSpringer Ben Goertzel, Cassio Pennachin (eds.) 2007
ISBN: 978-3-540-23733-4.
Canonical academic introduction of the AGI term. Goertzel and Legg popularised the term around 2002. Defines AGI as a system capable of performing any intellectual task at human level.
- arXiv Dan Hendrycks, Dawn Song, Christian Szegedy, Honglak Lee, Yarin Gal, Erik Brynjolfsson, Sharon Li, Andy Zou, Lionel Levine, Bo Han, Jie Fu, Ziwei Liu, Jinwoo Shin, Kimin Lee, Mantas Mazeika, Long Phan, George Ingebretsen, Adam Khoja, Cihang Xie, Olawale Salaudeen, Matthias Hein, Kevin Zhao, Alexander Pan, David Duvenaud, Bo Li, Steve Omohundro, Gabriel Alfour, Max Tegmark, Kevin McGrew, Gary Marcus, Jaan Tallinn, Eric Schmidt, Yoshua Bengio 2025 Accessed: 2026-05-16
Formal definition based on Cattell-Horn-Carroll theory across 10 cognitive domains. Documents the 'jagged cognitive profile' of current models — good academic background for the list of missing capabilities (reasoning, planning, on-the-fly learning).
- IBM Think Accessed: 2026-05-16
Solid industry definition. Quotes LeCun on the need for a new architecture. Explains why current LLMs (GPT-4), despite apparent versatility, remain ANI.
- Google Cloud 2026 Accessed: 2026-05-16
Accessible ANI/AGI/ASI definition. Quick reference for non-specialist readers.
- Wikipedia Accessed: 2026-05-16
Solid overview with the history of the term (Gubrud 1997, AIXI 2000, Legg/Goertzel 2002). ANI/AGI/ASI distinction. Lists 72 active AGI projects in 37 countries (2020).
- arXiv Katja Grace, Harlan Stewart, Julia Fabienne Sandkühler, Stephen Thomas, Ben Weinstein-Raun, Jan Brauner, Richard C. Korzekwa 2024 Accessed: 2026-05-16
Largest survey of AI researchers (2,778 respondents). Median estimate for 'high-level machine intelligence' shortened by 13 years between 2022 and 2023. Canonical source for the AGI timeline section.
- Effective Altruism Forum Vishakha Agrawal 2025 Accessed: 2026-05-16
Source for Yoshua Bengio's quote: '95% confidence interval for the time horizon of superhuman intelligence at 5 to 20 years' (2023).
- 80,000 Hours Benjamin Todd 2025 Accessed: 2026-05-16
Accessible survey of forecasts. Context for the '5 to 20 years' claim.
- Journal of Comparative Neurology Frederico A.C. Azevedo, Ludmila R.B. Carvalho, Lea T. Grinberg, José Marcelo Farfel, Renata E.L. Ferretti, Renata E.P. Leite, Wilson Jacob Filho, Roberto Lent, Suzana Herculano-Houzel 2009 Accessed: 2026-05-16
Canonical source for the 86 billion neuron figure. Corrects the widely repeated myth of 100 billion.
Source for the 100–500 trillion synapses range as standard approximation. Links to primary neuroscience sources.
- Scientific American Ferris Jabr 2012 Accessed: 2026-05-16
Confirms ~20W brain energy consumption and ~20% of total metabolism.
- IEA (International Energy Agency) 2025 Accessed: 2026-05-16
Canonical IEA report. Global data centres: 415 TWh in 2024, projected 945 TWh by 2030. GPT-4 training: ~50 GWh one-time energy cost.
- Congressional Research Service 2026 Accessed: 2026-05-16
Official US Congress report. 8 GPUs for 8 hours = 7.92 kW median power draw during training. Source for specific kW figures.
- PNAS Marcus E. Raichle and Debra A. Gusnard 2002 Accessed: 2026-05-16
Canonical neuroscience source for the ~20W brain energy figure and 20% of total metabolism.
- TechCrunch Kyle Wiggers 2025 Accessed: 2026-05-16
Official figures: 288B active parameters, nearly 2T total, 16 experts, MoE architecture. Confirms Behemoth was in training at the time of announcement.
- Computerworld 2025 Accessed: 2026-05-16
Documents the release delay. Context for the statement about the unreleased model.
- arXiv / Google Brain Noam Shazeer, Azalia Mirhoseini, Krzysztof Maziarz, Andy Davis, Quoc Le, Geoffrey Hinton, Jeff Dean 2017 Accessed: 2026-05-16
Original paper introducing MoE to neural networks. Describes the mechanism of activating only a subset of parameters per query. Canonical academic source for Mixture-of-Experts.
- arXiv / Google William Fedus, Barret Zoph, Noam Shazeer 2021 Accessed: 2026-05-16
First practical implementation of MoE in a language model at trillion-parameter scale. Foundation for LLaMA 4 Behemoth and similar architectures.
- arXiv Albert Gu, Tri Dao 2023 Accessed: 2026-05-16
Canonical source for State-Space Models as a potential transformer successor. Mamba as the leading candidate architecture.
5 Chapters
##1 Topic
10 sources
- Psychological Review George A. Miller 1956 Accessed: 2026-05-16
Original paper. Canonical source for the ~7 items in working memory. One of the most cited papers in the history of psychology.
- PubMed Central Nelson Cowan 2015 Accessed: 2026-05-16
Retrospective and critical analysis. More recent research suggests ~4 chunks as a more accurate limit. Background for the caveat that 'modern psychologists debated the exact number'.
- Thinking, Fast and SlowFarrar, Straus and Giroux Daniel Kahneman 2011
ISBN: 978-0-374-27563-1.
Canonical source for availability heuristic, anchoring, and System 1/2 thinking. The standard popular reference for the heuristics section.
- Science Amos Tversky, Daniel Kahneman 1974 Accessed: 2026-05-16
Original academic paper defining availability heuristic and anchoring.
- Human ErrorCambridge University Press James Reason 1990
ISBN: 978-0-521-31419-0.
Original work introducing the Swiss Cheese Model. Canonical source for the biomechanics of error section.
- BMJ James Reason 2000 Accessed: 2026-05-16
Accessible version of the Swiss Cheese Model for a medical audience. Describes it in the context of safety systems.
- The Challenger Launch DecisionUniversity of Chicago Press Diane Vaughan 1996
ISBN: 978-0-226-85175-4.
Original academic work introducing the term Normalisation of Deviance. Analysis of the Challenger disaster as a case study.
- The Field Guide to Understanding 'Human Error'Taylor & Francis Ltd Sidney Dekker 2006
ISBN: 978-0-7546-4825-8.
Accessible industry book on human error in complex systems. Covers normalisation of deviance and the Swiss Cheese Model in an engineering context.
- Journal of Personality and Social Psychology Baumeister, R. F., Bratslavsky, E., Muraven, M., Tice, D. M. 1998 Accessed: 2026-05-16
Original paper on ego depletion as the mechanism behind decision fatigue. Canonical academic source for the mechanism (without the specific 200 decisions figure).
- PNAS Shai Danziger, Jonathan Levav, Liora Avnaim-Pesso 2011 Accessed: 2026-05-16
The best-known empirical study of decision fatigue. Judges granted parole in ~65% of cases at the start of a session, dropping to ~0% just before a break. An ideal illustrative example.
416 Topics
2 sources
- HuffPost Bianca Bosker 2012 Accessed: 2026-05-16
Mayer (then VP at Google) described in a Q&A at 92nd Street Y in New York how Sergey Brin built the simplest possible page because he 'didn't have a webmaster and I don't do HTML.' Brin: 'We just kind of stumbled into it'.
Note: the text suggests both Page and Brin didn't know HTML — Mayer attributes this mainly to Brin. Minor inaccuracy, but it does not change the substance of the story.
- arnabocean.com (archive) Arnab Gupta 2012 Accessed: 2026-05-16
Shorter summary with quotes from Brin extracted
10 sources
- Unicode Consortium Accessed: 2026-05-16
Canonical repository of locale data. Contains number formatting, decimal separators, and thousands separators for all languages and countries.
- ISO 2022 Accessed: 2026-05-16
ISO standard governing the notation of numbers. Recommends a space as the thousands separator and a comma or period as the decimal separator depending on language.
- Wikipedia Accessed: 2026-05-16
Comprehensive table of decimal and thousands separators by country. Confirms the Swiss/Liechtenstein apostrophe notation, the Arabic momayyez, and the Indian lakh system.
- eBay Accessed: 2026-05-16
Official eBay policy. Confirms that bids are a 'binding contract'. Example given verbatim: 'entering £99.50 instead of £9.95'. Canonical source for the legally binding claim.
- eBay Community 2021 Accessed: 2026-05-16
Thread with dozens of seller anecdotes about decimal errors (silverware at $39.99 instead of $3,999.99, token at $9.90 instead of $99). Illustrates how common the phenomenon is.
- Lawyers.com / Ask-a-Lawyer Bruce Robins 2014 Accessed: 2026-05-16
Cites AutoTrader's terms: 'you must be prepared to sell that vehicle at the price at which you've listed it'. Also describes the 'obvious mistake' doctrine as an exception. Good for the legal dimension section.
Bruce Robins is an author of the best answer.
- Centrum Sprzedawcy 2013 Accessed: 2026-05-16
Article in Polish
Thorough legal analysis for Polish e-commerce. Explains that with 'Buy Now' the contract is formed automatically. A seller can invoke error under Art. 84 KC only if the error was material and the buyer could have noticed it easily.
- Subiektywnie o Finansach Ireneusz Sudak 2019 Accessed: 2026-05-16
Article in Polish
Concrete Polish case: a woodworking machine listed at 7,749 zł instead of 77,490 zł. The buyer refused to accept a price correction. Went to court.
- IT-Recht Kanzlei Daniel S. Huber 2015 Accessed: 2026-05-16
Article in German
Comprehensive analysis of § 119 BGB (Irrtumsanfechtung - Contestation on the Grounds of Error) in the context of online pricing errors. Cites OLG Frankfurt 2024 and BGH 2005. Explains that a seller can rescind the contract but must act immediately and pay Vertrauensschaden (Reliance damage).
- Landsberg Recht 2023 Accessed: 2026-05-16
Article in German
Practical analysis with a court example (LG Köln). Covers the requirements for a valid Anfechtung: immediacy, stated reason, documentation.
4 sources
- NBC News / Associated Press 2005 Accessed: 2026-05-16
Day-after account (9 December 2005). $225M loss, 41× the shares in existence, three cancellation attempts, 1.95% Nikkei drop, Mizuho spokesperson quote.
- Finextra 2006 Accessed: 2026-05-16
TSE's official admission that a bug prevented cancellation. Total accounting loss: ¥40.7 billion. Lists resignations (Tsurushima, Yoshino, Amano).
- Finextra 2009 Accessed: 2026-05-16
Court ruling: TSE must pay Mizuho ¥10.7 billion in damages for the system bug.
- Investopedia Anna Attkisson 2026 Accessed: 2026-05-16
Industry definition and context. J-Com as the canonical example. Other fat-finger cases for broader context.
6 sources
- Reuters / Investing.com 2021 Accessed: 2026-05-16
Primary account of Judge Furman's ruling. $893M, $7.8M intent, Brigade/HPS/Symphony, discharge-for-value 'to the penny'.
- National Law Review Christopher J. Dickson , Steven M. Herman, Blake C. Woodward 2021 Accessed: 2026-05-16
Cites court documents directly. Describes the six-eye approval procedure (maker/checker/approver = three people), the Flexcube mechanism, and the wash account. Best technical source for the error mechanism.
- Loeb & Loeb Anthony Pirraglia, Peter G. Seiden 2022 Accessed: 2026-05-16
Canonical source for the Second Circuit reversal. Inquiry notice vs. constructive notice. $500M return ordered.
- Seward & Kissel John R. Ashmead, Gregg S. Bateman, Keith J. Billotti, James C. Cofer, Michael G. Considine, Robert J. Gayda, Meir R. Grossman, Edward S. Horton, Mark D. Kotwick, Robert M. Kurucza, Kevin Neubauer, Anthony Tu-Sekine, Jack Yoskowitz, Y. Daphne Coelho-Adam, Kimberly Giampietro, Thomas Ross Hooper, Sagar Patel, Robert E. Wood, Dale C. Christensen Jr., Paul T. Clark, Ronald L. Cohen, Craig T. Hickernell, Robert A. Walder 2022 Accessed: 2026-05-16
Legal analysis of the Second Circuit ruling. Technical details of the reversal and context for the 'Revlon clawback provisions' the industry adopted after this case.
- Law Point Florida 2021 Accessed: 2026-05-16
Accessible analysis. Quotes the Furman ruling. Context for the UI as a source of the problem section.
- Davies Ward Phillips & Vineberg John McCamus, Michael Disney 2021 Accessed: 2026-05-16
Canadian law perspective. Good comparative analysis of the discharge-for-value mechanism.
6 sources
- TechRadar Alex Hamilton 2014 Accessed: 2026-05-16
Net Applications data, December 2013 (15 months post-launch): Win8 6.89%, Win8.1 3.60%, XP 28.98%.
- PCWorld Daniel Ionescu 2013 Accessed: 2026-05-16
StatCounter and Net Applications comparison. Win8 adoption rate 3× slower than Win7.
- Statista Mathias Brandt 2014 Accessed: 2026-05-16
Data visualisation. Confirms Win8 reached 10% after 15 months vs. 6 months for Win7.
- The Verge Tom Warren 2012 Accessed: 2026-05-16
Canonical launch-day review. Describes the Metro interface, Charms Bar, active corners, and the schizophrenic dual-world experience.
- Ars Technica 2012 Accessed: 2026-05-16
Most thorough technical review of Windows 8. Covers UI design decisions and their consequences for desktop users.
- The Guardian Jack Schofield 2015 Accessed: 2026-05-16
Context for the forced retreat. Windows 10 restored the Start menu and traditional desktop as the default mode.
6 sources
- FCC 2018 Accessed: 2026-05-16
Primary regulatory source. Error mechanism ('This is not a drill' used during an exercise), absence of retraction procedures, and absence of dual verification before the incident.
- Congressional Research Service 2018 Accessed: 2026-05-16
Congress investigation summary. Provides a concise timeline and confirms the absence of an automatic correction mechanism.
Timeline: 8:07 alert sent, 8:13 cancellation attempted, 8:20 Facebook/Twitter correction, 8:45 official retraction. No automatic correction mechanism existed.
- TrueMatter Dean Schuster Accessed: 2026-05-16
Interface screenshot and analysis. Covers 'DRILL - PACOM (CDW)' vs 'PACOM (CDW)' menu items. Before-and-after images of the corrected interface.
- Medium Ben Manley 2024 Accessed: 2026-05-16
Detailed reconstruction from the FCC report. Clarifies that the error was not a misclick but resulted from drill procedures. Context for the corrected error mechanism description.
- NPR Sasha Ingber 2018 Accessed: 2026-05-16
FCC recommendation to remove the phrase from exercises. Employee quote: '100% certain it was real'.
- Wikipedia Accessed: 2026-05-16
Complete documentation. Timeline, resignations (Miyagi, Clairmont), and the employee's history (10 years of performance issues, dismissed 26 January).
424 Topics
10 sources
- GAO (Government Accountability Office) 2014 Accessed: 2026-05-16
Primary government source. FFM obligations grew from $56M to $209M. Readiness assessment delayed from March to September 2013. Launch proceeded without verification of performance requirements.
- GAO (Government Accountability Office) 2015 Accessed: 2026-05-16
Follow-up audit. Ineffective project oversight, unreliable schedule, testing weaknesses. Seven recommendations for CMS.
- HHS Office of Inspector General 2016 Accessed: 2026-05-16
OIG podcast with report author Ruth Ann Dorrill. Ten lessons from the project. Absence of a central project leader throughout the entire duration identified as the primary cause of failure.
- NPR Julie Rovner 2013 Accessed: 2026-05-16
Confirms six registrations from House Oversight Committee internal documents. Political and media context.
- GovCIO Media Katherine MacPhail 2022 Accessed: 2026-05-16
Confirms six enrollments. Describes the 'tech surge' and the cultural shift at CMS.
- HackerNoon Bishr Tabbaa 2018 Accessed: 2026-05-16
Solid single-text summary of the entire failure. Covers the account wall as the bottleneck, the absence of load testing (1,100 concurrent users), costs, garbled insurer data, and the tech surge. Includes its own bibliography.
- Wikipedia Accessed: 2026-05-16
Complete documentation. Costs ($500M pre-launch, $1.7B final), stress test (1,100 users), account wall confirmed as bottleneck by White House officials.
- Time Kate Pickert 2014 Accessed: 2026-05-16
GAO testimony. $840M obligated as of March 2014. Context for the $500M figure as a pre-launch estimate.
- comScore Susan Engleson 2013 Accessed: 2026-05-16
Source for the 2.5 million figure. comScore measured unique visitors, not page views. Source for the conservative value used in the text.
- Level Up / GitConnected 2026 Accessed: 2026-05-16
Reports 4.7 million visits on day one using a different measurement methodology.
6 sources
- Computer Weekly Karl Flinders 2026 Accessed: 2026-05-16
Computer Weekly has investigated this story since 2008. Complete chronology from 1999 to 2024. Quotes the call centre instruction: 'You are the only one experiencing this problem'.
- Computer Weekly Karl Flinders 2025 Accessed: 2026-05-16
Detailed analysis of Fujitsu's role. Knowledge of bugs from 1999. Expert testimony in courts. 13 suicides linked to the scandal.
- Criminal Cases Review Commission Accessed: 2026-05-16
CCRC: 'biggest single series of wrongful convictions in UK legal history'. Covers the case review mechanism and legal basis for overturning convictions (Hamilton & Others v Post Office, 2021).
- Wikipedia Accessed: 2026-05-16
Complete documentation. 900+ prosecutions, 236 imprisoned, 13 suicides, compensation exceeding £1 billion, May 2024 Act overturning convictions en masse.
- Computerworld John E. Dunn 2025 Accessed: 2026-05-16
Coverage of Volume 1 of Sir Wyn Williams's report (July 2025). 13 suicides, 59 people considered suicide. Report quote: 'throughout the lifetime of Legacy Horizon, the Post Office maintained the fiction that its data was always accurate'.
- The Postal Museum Accessed: 2026-05-16
Institutional source and Legacy Project inquiry partner. Confirms the call centre instruction, Alan Bates's role, and the Justice for Subpostmasters Alliance.
7 sources
- Policy Circle Sagari Gupta 2025 Accessed: 2026-05-16
312 million monthly attempts. 20.3 million failures (6.5% failure rate). Clustering among manual labourers and the elderly. Figures sourced from UIDAI's own parliamentary data.
- Biometric Update Lu-Hai Liang 2025 Accessed: 2026-05-16
Parliamentary Public Accounts Committee confirms high error rates. Worn fingerprints and degraded iris patterns. PDS and MGNREGS as the primary areas of exclusion.
- India Stack Watch Accessed: 2026-05-16
Detailed documentation of the human cost including error rates and documented cases. Confirms the death of Santoshi Kumari (11 years old, Jharkhand, 2017) due to exclusion from the PDS system.
- ACM Interactions Subhashish Panigrahi 2022 Accessed: 2026-05-16
Academic analysis of the exclusion mechanism for Dalits, Adivasi, and migrant labourers. Biometric failure modes.
- The India Forum John Simte 2025 Accessed: 2026-05-16
Confirms the death of Santoshi Kumari (11 years old, Jharkhand, 2017). Legal analysis of the Supreme Court judgment of April 2025 (Pragya Prasun v. Union of India). Right to inclusive digital access.
- Nickled and Dimed 2025 Accessed: 2026-05-16
Jharkhand analysis. UIDAI Biometrics Standards Committee 2009 warned of worn fingerprint problems before deployment. Documents the three-stage failure in the PDS system.
- Pathways for Prosperity / Oxford BSG Prakhar Misra, Meena Bhandari Accessed: 2026-05-16
Oxford analysis. 'Perverse twist': the system failed to help the most vulnerable it was designed to protect. Context for the Last Mile section.
9 sources
- Rijksoverheid.nl Accessed: 2026-05-16
Article in Dutch
Official Dutch government page. Confirms the Rutte III resignation (15 January 2021), the apology, the formal acknowledgement of 'institutioneel racisme' (May 2022), and the ongoing compensation operation.
- Tweede Kamer 2020 Accessed: 2026-05-16
Article in Dutch
Official Van Dam parliamentary commission report (17 December 2020). Title: 'Ongekend onrecht' (Unprecedented injustice). Source for the quote: 'grondbeginselen van de rechtsstaat zijn geschonden' (fundamental principles of the rule of law were violated)."
- Tweede Kamer Accessed: 2026-05-16
Article in Dutch
Parliamentary commission page. Hearing schedule, composition, and documentation.
- AI Incident Database Sean McGregor, Khoa Lam 2018 Accessed: 2026-05-16
Official incident documentation. 'Fraud detection model described as a self-learning black box algorithm'. Dual nationality as a risk factor.
- TaxAdmin.AI Accessed: 2026-05-16
Legal and technical analysis. Algorithm mechanism (Dutch/non-Dutch as a binary risk indicator). References World Tax Journal (Hadwick & Lan, 2021).
- Wikipedia Accessed: 2026-05-16
Complete documentation. 26,000 parents (2005–2019), resignation 15 January 2021, Van Dam Commission, compensation for 33,000+ families, €2.75M GDPR fine.
- Amnesty International 2021 Accessed: 2026-05-16
Canonical source for the proxy discrimination section. Self-learning mechanism. 'Black box resulted in a black hole of accountability'.
- Social Policy and Administration (SAGE) Josien Arts, Marguerite van den Berg 2025 Accessed: 2026-05-16
Academic analysis of colonial legacy. Suriname and Caribbean Dutch as the primary affected groups. 'Institutional racism' formally acknowledged by the Dutch state.
- Museum of Failure 2021 Accessed: 2026-05-16
Touring exhibition by Dr. Samuel West. Footnote reference. Source for the figure of 3,532 children removed from their homes.
437 Topics
6 sources
- Automatica Lisanne Bainbridge 1983 Accessed: 2026-05-16
Original paper. Canonical source for the entire explainer. Vol. 19, Issue 6, pp. 775–779.
Full text PDF available: https://ckrybus.com/static/papers/Bainbridge_1983_Automatica.pdf
- Wikipedia Accessed: 2026-05-16
Solid overview with the history of the paper's influence and its key arguments.
- PNAS Eleanor A. Maguire, David G. Gadian, Ingrid S. Johnsrude, Christopher D. Frith 2000 Accessed: 2026-05-16
Original paper. Posterior hippocampus larger in taxi drivers than controls, correlating with years of experience.
- UCL News Eleanor A. Maguire, Katherine Woollett, Hugo J. Spiers 2006 Accessed: 2026-05-16
2011 follow-up study. Confirms the mechanism. Comparison with bus drivers (fixed routes = no effect).
- Scientific Reports Louisa Dahmani, Véronique D. Bohbot 2020 Accessed: 2026-05-16
McGill University paper. Correlation between GPS use and poorer spatial memory and reduced hippocampal activation. Describes the mechanism discussed in the text.
- Scientific American Thomas A. Herring 1996 Accessed: 2026-05-16
Classic popular-science piece on GPS and the atrophy of spatial orientation skills. Background for the 'atrophying muscle' narrative.
5 sources
- AI and Ethics Luciano Cavalcante Siebert, Maria Luce Lupetti, Evgeni Aizenberg, Niek Beckers, Arkady Zgonnikov, Herman Veluwenkamp, David Abbink, Elisa Giaccardi, Geert-Jan Houben, Catholijn M. Jonker, Jeroen van den Hoven, Deborah Forster & Reginald L. Lagendijk 2022 Accessed: 2026-05-16
Academic source for the Information/Time/Authority tripartition as conditions for meaningful human control. Formalises the intuition described in the text.
- Ethics and Information Technology Andreas Matthias 2004 Accessed: 2026-05-16
Canonical source for the 'responsibility gap' concept: when a human is in the loop only nominally, moral and legal responsibility becomes unassignable. Background for the 'liability sponge' framing.
- AI Now Institute 2018 Accessed: 2026-05-16
Describes the structural conditions under which HITL becomes an illusion. Context for the Authority section.
- YouTube / Don McMillan Don McMillan 2022 Accessed: 2026-05-16
For those who have made it this far in the bibliography — it had to be here.
This is not the original source of this sketch, but I couldn't find the right one.
- YouTube Don McMillan Accessed: 2026-05-16
Bonus. The author warmly recommends.
I couldn't link the original clip above so I am including the entire channel.
7 sources
- Michigan Attorney General 2022 Accessed: 2026-05-16
Official government press release. $20M settlement, Bauserman v. UIA, litigation history (2015–2024).
- Benefits Tech Advocacy Hub Accessed: 2026-05-16
Legal timeline. Zynda v. Zimmer (2017), Cahoo v. SAS Analytics, Michigan Supreme Court 2019 and 2022, final resolution 2024.
- AI Incident Database Ed White 2022 Accessed: 2026-05-16
- Bridge Michigan Ted Roelofs 2017 Accessed: 2026-05-16
93% error rate, $47M, the 'income spreading' mechanism, comparison to the Flint water crisis.
- IEEE Spectrum Robert N. Charette 2018 Accessed: 2026-05-16
Technical review. 40,195 cases algorithmically determined. 85% error rate without human intervention. 400% penalty.
- Wisconsin Law Review Rachael Kohl 2024 Accessed: 2026-05-16
Academic legal analysis. 93% error rate, 400% penalty as the highest in the country, the 'Dark Port' mechanism, retroactive accusations going back to 2007.
- The Register Thomas Claburn 2017 Accessed: 2026-05-16
Quotes Tony Paris (Sugar Law Center): at least two suicides linked to MiDAS penalties. Two clients left farewell letters.
6 sources
- AlgorithmWatch 2020 Accessed: 2026-05-16
Narrative account. Includes the low water usage example as a fraud flag. Lists the data sources integrated by the system.
- PILP (Public Interest Litigation Project) Baron Browne-Wilkinson Accessed: 2026-05-16
Page of the lawyers who won the case. Timeline from 2014 to the ruling. Volkskrant: 'not a single fraudster detected'.
- Human Rights Watch Amos Toh 2019 Accessed: 2026-05-16
Describes selective deployment in low-income neighbourhoods. Notification mechanism: up to 2 years to initiate an investigation.
- Digital Freedom Fund Tijmen Wisman 2020 Accessed: 2026-05-16
Analysis of the ruling. The legislation was passed in 2014 without a single dissenting vote, despite objections from the DPA and Council of State.
- OHCHR / UN Special Rapporteur Philip Alston 2020 Accessed: 2026-05-16
Official UN statement after the ruling. 'First time a court anywhere has stopped the use of digital technologies by welfare authorities on human rights grounds'.
- Big Data & Society (SAGE) Marvin van Bekkum, Frederik Zuiderveen Borgesius 2021 Accessed: 2026-05-16
Academic analysis of the ruling. History of the system from 2003. Implications for privacy law in the EU.
6 sources
- Wikipedia Accessed: 2026-05-16
Complete documentation. Molniya orbit + sunlight + high-altitude clouds mechanism. Malmstrom AFB. Alarm timeline. Petrov's role.
Note: Wikipedia used as the primary source here given the event is well-documented and cross-referenced by multiple additional sources.
- Wikipedia Accessed: 2026-05-16
Biography. Reprimand for paperwork (verbatim quote). Nervous breakdown. Early retirement. Awards (Dresden Peace Prize 2013, Future of Life Award 2018 posthumously). Ban Ki-moon quote.
- Hackaday Zoe Skyforest 2021 Accessed: 2026-05-16
Technical explanation of the false positive mechanism. Diagram of satellite, sun, and cloud geometry. Molniya orbit context.
- Web Archive / BBC News 2006 Accessed: 2026-05-16
Original BBC account. Nervous breakdown, early retirement, life in poverty in a Moscow flat. Cited by Wikipedia.
- RFE/RL Dan Wisniewski 2013 Accessed: 2026-05-16
Detailed portrait of Petrov. Institutional gaslighting after the incident. Contrast between the absence of recognition and later international acclaim.
- Veterans Breakfast Club Todd DePastino Accessed: 2026-05-16
Narrative account. Petrov quote: 'I had a funny feeling in my gut'. Context of KAL 007 (25 days earlier) as additional background tension.
5 sources
- NTSB 2016 Accessed: 2026-05-16
Official NTSB report. Hands on wheel for 25 seconds out of 37.5 minutes. 7 visual warnings. Sensor failure mechanism. Automation Bias findings.
- Jalopnik Ryan Felton 2017 Accessed: 2026-05-16
Includes the Brown family statement. Quote that Brown 'repeatedly emphasized safety, that the car was NOT autonomous'. Context for the Automation Bias section.
- NTSB Report Largely Clears Tesla in May 2016 Fatal CrashThe Detroit Bureau Paul A. Eisenstein 2017 Accessed: 2026-05-16
Technical details. White truck against bright sky. 6 audio chimes. Tesla/Mobileye split after the crash.
At the time of compiling this bibliography original link was dead (HTTP/404).
- NTSB Finds Tesla Partly to Blame in Fatal Self-Driving Car CrashGWC Law Harris Elliot 2017 Accessed: 2026-05-16
Legal analysis of the ruling. Liability mechanism. Note on the Harry Potter movie allegation (unconfirmed).
At the time of compiling this bibliography original portal was dead (numerous links throwing HTTP/403).
- NHTSA 2017 Accessed: 2026-05-16
Parallel NHTSA report. No design defect found in Autopilot. Crash rates declined after Autopilot introduction.
6 sources
- Wikipedia Accessed: 2026-05-16
Complete documentation. 346 deaths, AoA sensor failure mechanism, MCAS, grounding timeline.
- FAA 2021 Accessed: 2026-05-16
Official FAA communications. '20-month safety review process'. Required changes before return to service. JATR reference.
- PBS Frontline Priyanka Boghani, Kaela Malig 2024 Accessed: 2026-05-16
Complete timeline. Internal Boeing documents. Quote: 'I basically lied to the regulators'. $20B+ in losses.
- Sen. Tammy Duckworth (Press Release) 2024 Accessed: 2026-05-16
Quotes the JATR report: 'FAA was not completely unaware of MCAS; however... it was difficult to recognize the impacts.' Describes the ODA mechanism and how Boeing downplayed MCAS during certification.
- U.S. House Committee on Transportation and Infrastructure Accessed: 2026-05-16
Official congressional investigation page. Reports, hearings, and correspondence with Boeing and FAA.
- NPR David Schaper 2021 Accessed: 2026-05-16
Settlement details: $243.6M fine, $500M for victims' families, $1.77B for airlines. DOJ quote: 'Boeing's employees chose the path of profit over candor'.
446 Topics
1 sources
- adamkorga.com Adam Korga 2026 Accessed: 2026-05-16
Author's own disclosure page with additional details on AI use in this work.
The Web Archive hasn't discovered this page (yet), so I can't provide an archival link... but since you're reading this bibliography, that probably isn't a problem.
5 sources
- Justia / U.S. District Court S.D.N.Y. 2023 Accessed: 2026-05-16
Primary legal document. No. 1:22-cv-01461 (PKC). Judge Castel's sanctions order.
- Justia 2022 Accessed: 2026-05-16
Complete docket for the case.
- CourtListener / PACER Steven A. Schwartz 2023 Accessed: 2026-05-16
The document containing the verbatim quote: 'I had no idea that ChatGPT could fabricate cases'.
- The New York Times Benjamin Weiser, Nate Schweber 2023 Accessed: 2026-05-16
- The Guardian Dan Milmo 2023 Accessed: 2026-05-16
The only source I found that mentions second lawyer. Maybe he should be jealous of Schwartz's fame?
6 sources
- Futurism Frank Landymore 2023 Accessed: 2026-05-16
Original investigation (10 January 2023). Discovery of the 'CNET Money Staff' byline. AI authorship concealment mechanism.
- Futurism Jon Christian 2023 Accessed: 2026-05-16
Detailed plagiarism analysis. Side-by-side comparisons with Forbes Advisor, The Balance, and Investopedia. Prof. Schatten quote: 'clearly' plagiarism.
- Gizmodo Lauren Leffer 2023 Accessed: 2026-05-16
Five errors in a single compound interest article. Error mechanism ($10,000 @ 3%). CNET only corrected after Futurism alerts.
- Engadget Igor Bonifacic 2023 Accessed: 2026-05-16
41/77 corrections (53%). The Verge quote on 'replaced phrases that were not entirely original'. Plagiarism checker 'wasn't used properly'.
- Vibe Graveyard 2023 Accessed: 2026-05-16
Complete timeline from November 2022 to the corrections. Guglielmo quote: corrections were 'substantial'. Compound interest error as the trigger for the full audit.
- Google Search Central Accessed: 2026-05-16
Official Google E-E-A-T documentation (Experience, Expertise, Authoritativeness, Trustworthiness). Context for the guidelines update claim.
8 sources
- 이코노미스트 코리아 [Economist Korea] 정두용 [Jeong Du-yong] 2023 Accessed: 2026-05-16
Original Korean-language investigative report (30 March 2023).
Primary source for all key facts: 20 days from permission to leaks, three incidents, 1024-byte emergency limit, Naver Clova for meeting transcription.
- CIO Dive Lindsey Wilkinson 2023 Accessed: 2026-05-16
Primary English-language account. Three incidents, 1024-byte limit, Naver Clova used for meeting transcription.
- AI Incident Database Daniel Atherton 2023 Accessed: 2026-05-16
Official incident documentation. Timeline: Samsung permitted ChatGPT on 11 March 2023, three leaks by end of month.
- CS Hub Olivia Powell 2023 Accessed: 2026-05-16
Confirms 20 days, three incidents, disciplinary investigations. Context of OpenAI's privacy policy.
- RedTeams.ai 2026 Accessed: 2026-05-16
Most detailed technical analysis. Full timeline from March to May 2023. 1024-byte limit with no technical enforcement. Ban issued 1 May.
- DataFence 2024 Accessed: 2026-05-16
Describes the three leak categories. IP value context in the semiconductor industry. Lessons for corporations.
- Samsung Developer 2023 Accessed: 2026-05-16
Official Samsung Gauss launch coverage.
- MultiLingual Cameron Rasmusson 2023 Accessed: 2026-05-16
5 sources
- CanLII 2024 Accessed: 2026-05-16
Full ruling text. Verbatim 'remarkable submission' quote. Rejection of the 'separate legal entity' defence. Duty of care and negligent misrepresentation.
- McCarthy Tétrault Barry B. Sookman 2024 Accessed: 2026-05-16
Legal analysis of the implications. Apparent authority doctrine. Chatbot liability under Canadian law.
- American Bar Association Lisa R Lifshitz, Roland Hung 2024 Accessed: 2026-05-16
American perspective on the implications for AI agency and corporate liability in the US.
- Dentons Kirsten Thompson 2024 Accessed: 2026-05-16
Additional legal analysis. Verbatim ruling quote: 'There is no reason why Mr. Moffatt should know that one section of Air Canada's webpage is accurate, and another is not'.
- AI & Society (Springer) Joshua L. M. Brand 2024 Accessed: 2026-05-16
Academic analysis of the ethical and legal implications. Argues 'hallucination' is a misnomer — all LLM outputs are probabilistic, not just the wrong ones. Compensation amount: CAN$812.02.
9 sources
- Retraction Watch Frederik Joelving 2025 Accessed: 2026-05-16
Original account. Alexander Magazinov and PubPeer. OCR theory and the 1959 article. Elsevier defending the term. Guillaume Cabanac and the Problematic Paper Screener.
- Retraction Watch Frederik Joelving 2025 Accessed: 2026-05-16
Farsi typo theory. The difference of a single dot between پویشی and رویشی. Iranian fraudster network.
- Bacteriological Reviews (ASM) R.E. Strange 1959 Accessed: 2026-05-16
Original 1959 paper. Vol. 23, Issue 1. The word 'vegetative' appears on page 4, third line of the last paragraph in the left column; 'electron microscopy' appears in the corresponding line of the right column — the likely source of the OCR artefact.
- MDPI Materials 2024 Accessed: 2026-05-16
Example paper containing 'vegetative electron microscopy'. MDPI issued a correction replacing the term with 'scanning electron microscopy'.
Note: I am omitting the names of the authors of the original work, as the purpose of this bibliography is not to ridicule potential victims of paper mills.
- PubPeer 2022 Accessed: 2026-05-16
Original public report on PubPeer (2022). Starting point for the entire investigation.
- A Weird Phrase Is Plaguing Scientific Papers — and We Traced It Back to a Glitch in AI Training DataThe Conversation Aaron J. Snoswell, Kevin Witzenberger, Rayane El Masri 2025 Accessed: 2026-05-16
Accessible popular-science account. Model collapse and garbage-in-garbage-out mechanism in the scientific publishing context.
- ZME Science Mihai Andrei 2025 Accessed: 2026-05-16
22 papers affected. Both theories (OCR + Farsi). Timeline.
- For Better Science Leonid Schneider 2023 Accessed: 2026-05-16
Broader context: paper mills and systematic manipulation of the publication process. Background for the mechanism described in the text.
- YouTube/Uwaga! Naukowy Bełkot [Attention! Scientific Nonsense] Dawid Myśliwiec 2015 Accessed: 2026-05-16
Video in Polish
Transcript contains bibliographic details of the 1959 article and a discussion of the three systemic problems in science illuminated by this incident.