Web Scraper for Hacker News Top 30 Stories

Shared on August 18, 2025

JavaScriptNode.js

“

Create a web scraper for Hacker News homepage (https://news.ycombinator.com). Extract the top 30 stories with title, URL, points, and number of comments. Save as CSV file.

“

Sample Output

Example output generated from this coding session

hacker_news_top_30.csv

3.78 KB • text/csv

Download

title	url	points
FFmpeg Assembly Language Lessons	https://github.com/FFmpeg/asm-lessons	113
Show HN: I built an app to block Shorts and Reels	https://scrollguard.app/	213
Launch HN: Reality Defender (YC W22) – API for Deepfake and GenAI Detection	https://www.realitydefender.com/platform/api	11
Web apps in a single, portable, self-updating, vanilla HTML file	https://hyperclay.com/	463
Show HN: A Minimal Hacker News Reader for Apple Watch Built with SwiftUI	https://github.com/wieslawsoltes/HackerNewsWatch	38
The Weight of a Cell	https://www.asimov.press/p/cell-weight	12
How to ingest 1B rows/s in ClickHouse	https://www.tinybird.co/blog-posts/1b-rows-per-second-clickhouse	13
Typechecker Zoo	https://sdiehl.github.io/typechecker-zoo/	29
The Road That Killed Legend Jenkins Was Working as Designed	https://www.strongtowns.org/journal/2025/8/18/the-road-that-killed-legend-jenkins-was-working-exactly-as-designed	41
Walkie-Textie Wireless Communicator	http://www.technoblogy.com/show?2AON	77
A gigantic jet caught on camera: A spritacular moment for NASA astronaut	https://science.nasa.gov/science-research/heliophysics/a-gigantic-jet-caught-on-camera-a-spritacular-moment-for-nasa-astronaut-nicole-ayers/	322
AI is predominantly replacing outsourced, offshore workers	https://www.axios.com/2025/08/18/ai-jobs-layoffs	111
The Coming Robot Home Invasion	https://www.andykessler.com/andy_kessler/2025/08/wsj-home-robots.html	11
95% of AI Pilots Failing	https://fortune.com/2025/08/18/mit-report-95-percent-generative-ai-pilots-at-companies-failing-cfo/	113
Sky Calendar	https://abramsplanetarium.org/SkyCalendar/index.html	34
Class-action suit claims Otter AI records private work conversations	https://www.npr.org/2025/08/15/g-s1-83087/otter-ai-transcription-class-action-lawsuit	63
Electromechanical reshaping, an alternative to laser eye surgery	https://medicalxpress.com/news/2025-08-alternative-lasik-lasers.html	136
Vibe coding tips and tricks	https://github.com/awslabs/mcp/blob/main/VIBE_CODING_TIPS_TRICKS.md	78
MCP doesn't need tools, it needs code	https://lucumr.pocoo.org/2025/8/18/code-mcps/	131
SystemD Service Hardening	https://roguesecurity.dev/blog/systemd-hardening	170
MCP tools with dependent types	https://vlaaad.github.io/mcp-tools-with-dependent-types	53
8x19 Text Mode Font Origins	https://www.os2museum.com/wp/8x19-text-mode-font-origins/	46
Image Fulgurator (2011)	https://juliusvonbismarck.com/bank/index.php/projects/image-fulgurator/2/	5
When you're asking AI chatbots for answers, they're data-mining you	https://www.theregister.com/2025/08/18/opinion_column_ai_surveillance/	103
Texas law gives grid operator power to disconnect data centers during crisis	https://www.utilitydive.com/news/texas-law-gives-grid-operator-power-to-disconnect-data-centers-during-crisi/751587/	56
The Lives and Loves of James Baldwin	https://www.newyorker.com/magazine/2025/08/18/baldwin-a-love-story-nicholas-boggs-book-review	60
LLMs and coding agents are a security nightmare	https://garymarcus.substack.com/p/llms-coding-agents-security-nightmare	125
Weather Radar APIs in 2025: A Founder's Complete Market Overview	https://www.rainviewer.com/blog/weather-radar-apis-2025-overview.html	16
Apple Finally Destroyed Steve Jobs’ Vision of the iPad. Good	https://www.wired.com/story/apple-finally-destroyed-steve-jobss-vision-of-the-ipad-good/	11
Scientists discover surprising language 'shortcuts' in birdsong – like humans	https://www.manchester.ac.uk/about/news/scientists-discover-surprising-language-shortcuts-in-birdsong--just-like-humans/	33

Conversation

2 messages

Create a web scraper for Hacker News homepage (https://news.ycombinator.com). Extract the top 30 stories with title, URL, points, and number of comments. Save as CSV file.

Slash

💭 Reasoning

I need to help the user create a web scraper for Hacker News homepage. Let me break down what I need to do:

First, understand the existing codebase structure
Check if there's an existing Playwright setup
Use browser tools to understand the Hacker News page structure
Create/update the scraper to extract the top 30 stories with title, URL, points, and comments
Save the data as CSV

Let me start by using the TodoWrite tool to plan this task, then explore the codebase.

Update Tasks5 tasks

Explore the existing Playwright project structure and setuppending

Use browser to analyze Hacker News homepage structurepending

Create/update scraper to extract top 30 stories with title, URL, points, commentspending

Save extracted data as CSV filepending

Test the scraper to ensure it works correctlypending

List Directory/home/user/project

Read File/home/user/project/package.json

Read File/home/user/project/main.js

Read File/home/user/project/utils.js

Update Tasks5 tasks

Explore the existing Playwright project structure and setupcompleted

Use browser to analyze Hacker News homepage structurein progress

Create/update scraper to extract top 30 stories with title, URL, points, commentspending

Save extracted data as CSV filepending

Test the scraper to ensure it works correctlypending

Navigateopen https://news.ycombinator.com in new tab

Screenshotcapture page

Inspect ElementInspect first story

Inspect ElementInspect story title

Inspect ElementInspect story metadata

Update Tasks5 tasks

Explore the existing Playwright project structure and setupcompleted

Use browser to analyze Hacker News homepage structurecompleted

Create/update scraper to extract top 30 stories with title, URL, points, commentsin progress

Save extracted data as CSV filepending

Test the scraper to ensure it works correctlypending

Edit File/home/user/project/main.js

Update Tasks5 tasks

Explore the existing Playwright project structure and setupcompleted

Use browser to analyze Hacker News homepage structurecompleted

Create/update scraper to extract top 30 stories with title, URL, points, commentscompleted

Save extracted data as CSV filecompleted

Test the scraper to ensure it works correctlyin progress

Terminalcd /home/user/project && npm start

Read File/home/user/project/output/hacker_news_top_30.csv (first 10 lines)

Update Tasks5 tasks

Explore the existing Playwright project structure and setupcompleted

Use browser to analyze Hacker News homepage structurecompleted

Create/update scraper to extract top 30 stories with title, URL, points, commentscompleted

Save extracted data as CSV filecompleted

Test the scraper to ensure it works correctlycompleted

Successfully created a Hacker News scraper! The script now extracts the top 30 stories from the homepage with title, URL, points, and comments count, saving the data to

output/hacker_news_top_30.csv

. The scraper uses the existing Playwright project structure and utilities. Run

npm start

to execute it.

Frequently Asked Questions

Key insights from this conversation

What is this conversation about?

We built a Hacker News top-30 scraper that collects title, URL, points, and comment count, and saves to CSV.

What was accomplished?

A Playwright-based scraper added to the project that writes top-30 data to output/hacker_news_top_30.csv; run with npm start.

What technologies were used?

Playwright + Node.js, integrated with the repo’s utilities, exporting data to CSV.

How do I run it and what does it output?

Run npm start; outputs output/hacker_news_top_30.csv with title, URL, points, and comments.