References
Python Libraries
[1] Richardson, L. Beautiful Soup 4 Documentation. 2004–present. https://www.crummy.com/software/BeautifulSoup/bs4/doc/
[2] Encode. HTTPX: A next-generation HTTP client for Python. 2019–present. https://www.python-httpx.org/
[3] lxml project. lxml: Processing XML and HTML with Python. 2005–present. https://lxml.de/
[4] Microsoft. Playwright for Python. 2020–present. https://playwright.dev/python/
[5] Python Software Foundation. asyncio: Asynchronous I/O. Python 3 Standard Library. 2013–present. https://docs.python.org/3/library/asyncio.html
[6] Scrapy project / Zyte. Scrapy Documentation. 2008–present. https://docs.scrapy.org/en/latest/
[7] Selenium project. Selenium WebDriver Documentation. 2004–present. https://www.selenium.dev/documentation/webdriver/
[8] Gronholm, A. Advanced Python Scheduler (APScheduler) Documentation. 2009–present. https://apscheduler.readthedocs.io/en/3.x/
[9] Celery project. Celery: Distributed Task Queue Documentation. 2009–present. https://docs.celeryq.dev/en/stable/
[10] MagicStack. asyncpg: A fast PostgreSQL database client library for Python/asyncio. 2016–present. https://magicstack.github.io/asyncpg/current/
[11] Zope Foundation. RestrictedPython Documentation. 2002–present. https://restrictedpython.readthedocs.io/en/latest/
[12] Kumar, S. python-dotenv: Read key-value pairs from a .env file. 2014–present. https://github.com/theskumar/python-dotenv
[13] Tahri, A. R. charset-normalizer: A modern encoding detector. 2019–present. https://charset-normalizer.readthedocs.io/
AI and LLM Tools
[14] OpenRouter. OpenRouter API Documentation: Quickstart Guide. 2023–present. https://openrouter.ai/docs/quickstart
[15] Anthropic. Model Context Protocol Specification (2025-11-25). 2024–present. https://modelcontextprotocol.io/specification/2025-11-25
[16] Lowin, J. FastMCP 2.0: The fast, Pythonic way to build MCP servers and clients. 2024–present. https://gofastmcp.com/v2/getting-started/welcome
[17] OpenAI. OpenAI Python API Library. 2020–present. https://github.com/openai/openai-python
Web Standards
[18] W3C CSS Working Group. Selectors Level 4. W3C Working Draft, 2022. https://www.w3.org/TR/selectors-4/
[19] W3C. Selectors API Level 1. W3C Recommendation, 2013. https://www.w3.org/TR/selectors-api/
[20] Koster, M., Illyes, G., Zeller, H., and Sassman, L. RFC 9309: Robots Exclusion Protocol. IETF Standards Track, 2022. https://datatracker.ietf.org/doc/html/rfc9309
[21] Bray, T. (Ed.). RFC 8259: The JavaScript Object Notation (JSON) Data Interchange Format. IETF Internet Standard (STD 90), 2017. https://datatracker.ietf.org/doc/html/rfc8259
[22] Fielding, R. and Reschke, J. RFC 7230: Hypertext Transfer Protocol (HTTP/1.1): Message Syntax and Routing. IETF Standards Track, 2014. https://datatracker.ietf.org/doc/html/rfc7230
[23] W3C XML Query Working Group. XML Path Language (XPath) 3.1. W3C Recommendation, 2017. https://www.w3.org/TR/xpath-31/
Books
[24] Mitchell, R. Web Scraping with Python: Data Extraction from the Modern Web (3rd ed.). O’Reilly Media, 2024. https://www.oreilly.com/library/view/web-scraping-with/9781098145347/
[25] Russell, M. A. and Klassen, M. Mining the Social Web (3rd ed.). O’Reilly Media, 2019. https://www.oreilly.com/library/view/mining-the-social/9781491973547/
[26] McKinney, W. Python for Data Analysis: Data Wrangling with pandas, NumPy, and Jupyter (3rd ed.). O’Reilly Media, 2022. https://www.oreilly.com/library/view/python-for-data/9781098104023/
Legal and Academic
[27] U.S. Court of Appeals, Ninth Circuit. hiQ Labs, Inc. v. LinkedIn Corp., No. 17-16783, 2022. https://law.justia.com/cases/federal/appellate-courts/ca9/17-16783/17-16783-2022-04-18.html
[28] U.S. Department of Justice. Justice Manual 9-48.000: Computer Fraud and Abuse Act. 1986 (statute); page current. https://www.justice.gov/jm/jm-9-48000-computer-fraud
[29] Legal Information Institute, Cornell Law School. 18 U.S. Code § 1030: Fraud and Related Activity in Connection with Computers. 1986 (statute); page current. https://www.law.cornell.edu/uscode/text/18/1030
Databases and Storage
[30] Hipp, D. R. et al. SQLite Documentation. 2000–present. https://sqlite.org/docs.html
[31] PostgreSQL Global Development Group. PostgreSQL Documentation (current release). 1996–present. https://www.postgresql.org/docs/current/index.html
[32] MongoDB, Inc. MongoDB Documentation. 2009–present. https://www.mongodb.com/docs/
[33] Redis Ltd. Redis Documentation. 2009–present. https://redis.io/docs/latest/
Tools and Services
[34] Microsoft. playwright-python: Playwright for Python (GitHub repository). 2020–present. https://github.com/microsoft/playwright-python
[35] Google. Puppeteer Documentation. 2017–present. https://pptr.dev/
[36] Marby, D. and Yonskai, N. Lorem Picsum: Lorem Ipsum for photos. 2017–present. https://picsum.photos/
[37] Docker, Inc. Docker Documentation. 2013–present. https://docs.docker.com/
[38] Project Jupyter. Project Jupyter Documentation. 2014–present. https://docs.jupyter.org/
[39] uv project (Astral). uv: An extremely fast Python package manager. 2023–present. https://docs.astral.sh/uv/
Articles and Blog Posts
[40] Anthropic. “Introducing the Model Context Protocol.” Anthropic News, November 2024. https://www.anthropic.com/news/model-context-protocol
[41] The New Stack. “Why the Model Context Protocol Won.” 2025. https://thenewstack.io/why-the-model-context-protocol-won/
[42] Cocca, G. “Server Side Rendering in JavaScript: SSR vs CSR Explained.” freeCodeCamp, 2023. https://www.freecodecamp.org/news/server-side-rendering-javascript/
[43] Prismic. “Client-side Rendering vs. Server-side Rendering.” 2024. https://prismic.io/blog/client-side-vs-server-side-rendering
[44] ScrapingBee. “A Web Scraper’s Guide to Robots.txt.” 2023. https://www.scrapingbee.com/blog/robots-txt-web-scraping/
[45] ScrapingBee. “Web Scraping Best Practices.” 2024. https://www.scrapingbee.com/blog/web-scraping-best-practices/
[46] Brosset, P. “The Truth About CSS Selector Performance.” Microsoft Edge Blog, January 2023. https://blogs.windows.com/msedgedev/2023/01/17/the-truth-about-css-selector-performance/
[47] Mozilla. “CSS Performance Optimization.” MDN Web Docs. 2023–present. https://developer.mozilla.org/en-US/docs/Learn_web_development/Extensions/Performance/CSS
[48] ScrapeHero. “How to Overcome Rate Limiting in Web Scraping: Complete Guide.” 2024. https://www.scrapehero.com/rate-limiting-in-web-scraping/
[49] Crockford, D. Introducing JSON. 2002–present. https://www.json.org/
[50] ZwillGen PLLC. “hiQ v. LinkedIn Wrapped Up: Web Scraping Lessons Learned.” 2022. https://www.zwillgen.com/alternative-data/hiq-v-linkedin-wrapped-up-web-scraping-lessons-learned/