The Journey of a URL — What Actually Happens Between Pressing Enter and Seeing a Page

Have you ever thought about what happens after you type a URL and press Enter?

I hadn’t either. Then one day, gu-log’s offline download button broke, and I had to trace the entire chain. Turns out, between your finger leaving the keyboard and the page appearing on screen, at least 7 stages happen — spanning undersea fiber optic cables from Taipei to Tokyo — all in under half a second.

Half a second. You can’t even finish a sneeze in half a second, but the browser has already run a complete relay race (╯°□°)⁠╯

This post uses gu-log itself as the case study. We’ll crack open each of those 7 stages. After reading this, every time you open any website, your brain will automatically go “ah, that’s the TLS handshake right now.”

🏰 Floor 0: The Big Picture — Life of a Request

⚔️ Level 0 / 7 Journey of a URL

0% 完成

Let’s see the full picture first, then break each Floor down:

You press Enter
    ↓
1. DNS Lookup     → "What's the IP for gu-log.vercel.app?"
    ↓
2. TCP Handshake  → "Hey, are you there?" "Yeah." "Cool, let's go."
    ↓
3. TLS Handshake  → "Let's speak in code so nobody can eavesdrop."
    ↓
4. HTTP Request   → "Give me /index.html"
    ↓
5. HTTP Response  → "Here you go, 200 OK, with HTML"
    ↓
6. Rendering      → Browser turns HTML into the page you see
    ↓
7. Cache / SW     → "Next time, I'll just pull it from my pocket"

The entire flow takes about 200-500ms on a decent connection. Yes, under half a second.

Clawd PSA:

Under half a second for 7 layers. It takes three minutes to make instant noodles — the browser could run this entire flow 360 times while you wait for your noodles. Next time someone complains “this website is so slow,” I want to say: do you have any idea how much engineering is crammed into that 0.5 seconds you’re complaining about? ┐(￣ヘ￣)┌

❓ 小測驗

Which of the 7 steps translates a domain name into an IP address?

🏰 Floor 1: DNS — The Internet’s Phone Book

⚔️ Level 1 / 7 Journey of a URL

14% 完成

You typed gu-log.vercel.app. The browser’s first move:

“What’s the IP address for this name?”

In pseudocode:

# DNS lookup flow (simplified)
def dns_lookup(domain: str) -> str:
    # 1. Check local cache
    if domain in browser_dns_cache:
        return browser_dns_cache[domain]

    if domain in os_dns_cache:
        return os_dns_cache[domain]

    # 2. Ask DNS Resolver (recursive query)
    ip = dns_resolver.query(domain)
    # resolver internally: root → .app → vercel.app → gu-log.vercel.app

    # 3. Store in cache for next time
    browser_dns_cache[domain] = ip  # TTL usually 5min ~ 1hr
    return ip

ip = dns_lookup("gu-log.vercel.app")
# → "76.76.21.21"

Clawd PSA:

DNS was invented in 1983 — older than most of our readers. Forty-something years later, the entire internet still runs on this “check the phone book first” logic for trillions of requests every day. Is it old-fashioned? Extremely. But it’s like that breakfast shop on your street corner that’s been open for 30 years — not trendy, not cool, no AI involved, but you still show up every morning because it just works (￣▽￣)⁠／

Where DNS can go wrong:

DNS cache expires (TTL ran out) → has to re-query, adds 20-100ms
DNS server goes down → website seems broken but the server is actually fine
DNS poisoning (in some regions) → returns the wrong IP

❓ 小測驗

Why does the browser have a DNS cache?

🏰 Floor 2: TCP + TLS — Handshakes and Secret Codes

⚔️ Level 2 / 7 Journey of a URL

29% 完成

Got the IP. Now we need to “establish a connection.”

Two steps: TCP handshake (confirm the other side exists) → TLS handshake (encrypted channel).

TCP Three-Way Handshake:

Your Browser            Vercel Server
    |                        |
    |--- SYN --------------->|    "Hey, are you there?"
    |                        |
    |<-- SYN-ACK ------------|    "Yeah, are you there too?"
    |                        |
    |--- ACK --------------->|    "Yep. Let's go."
    |                        |
    |   ✅ TCP connected      |

Python Analogy 類比

TCP is like when you write FastAPI — uvicorn first bind() to a port, then listen(), waiting for a client to connect(). The three-way handshake is the client and server confirming “I can reach you, and you can reach me.”

# Simplified — actually handled by OS kernel
import socket
sock = socket.socket(socket.AF_INET, socket.SOCK_STREAM)
sock.connect(("76.76.21.21", 443))  # Three-way handshake happens here

TLS Handshake (encrypted handshake):

TCP is connected, but everything is plaintext. Since gu-log uses HTTPS (that S = Secure), we need TLS encryption:

Your Browser            Vercel Server
    |                        |
    |--- ClientHello ------->|   "I support these encryption methods"
    |                        |
    |<-- ServerHello --------|   "OK, let's use TLS 1.3 + AES-256"
    |<-- Certificate + Key --|   "Here's my ID (SSL certificate)"
    |                        |
    |   [Verify cert ✅]      |
    |--- Key Exchange ------>|   "Encrypt with this shared secret"
    |                        |
    |   ✅ Encrypted channel  |

Clawd twists the knife:

Classic interview question: “What’s the difference between HTTPS and HTTP?” If you answer “it’s more secure,” congrats — the interviewer’s brain has already checked out. That’s like answering “what’s the difference between having a lock and not having one” with “it’s safer.” Correct, but useless. Proper answer: HTTP is plaintext, HTTPS adds a TLS layer on top of TCP that does three things — encryption (prevent eavesdropping), integrity (prevent tampering), and authentication (prevent impersonation). Nail those three in your interview and the interviewer will think you actually studied (⌐■_■)

Time cost:

TCP handshake: 1 RTT (round-trip time), roughly 10-50ms
TLS 1.3 handshake: 1 RTT (TLS 1.2 needs 2)
Together about 20-100ms, depending on distance to the server

❓ 小測驗

Why does TCP need three handshakes — wouldn't two be enough?

🏰 Floor 3: HTTP — Ordering and Serving

⚔️ Level 3 / 7 Journey of a URL

43% 完成

Encrypted channel is up. Time to talk business.

The browser sends an HTTP Request:

GET / HTTP/2
Host: gu-log.vercel.app
Accept: text/html
User-Agent: Mozilla/5.0 (iPhone; CPU iPhone OS 26_0 ...)
Accept-Language: zh-TW,zh;q=0.9,en;q=0.8

Vercel’s server replies:

HTTP/2 200 OK
Content-Type: text/html; charset=utf-8
Cache-Control: public, max-age=0, must-revalidate
Content-Length: 48726

<!DOCTYPE html><html lang="zh-TW">...

# FastAPI comparison: you write the Response side every day
from fastapi import FastAPI
app = FastAPI()

@app.get("/")
async def homepage():
    return HTMLResponse(content="<html>...", status_code=200)
    # ↑ This is the HTTP Response the browser receives

Important Headers you should know:

Cache-Control — tells the browser “should you store this response, and for how long”
Content-Type — is this package HTML? JSON? An image?
Set-Cookie — server puts a cookie on you (login state, etc.)
ETag — a fingerprint of the content, so next time you can ask “did this change?” (saves bandwidth)

Clawd OS:

When you write return {"hello": "world"} in FastAPI, uvicorn handles all the dirty work behind the scenes — TCP, TLS, HTTP headers, the whole thing. You just write business logic; it handles the logistics. Frameworks exist not because you can’t do it yourself, but because you can, and by the third time you’d want to quit your job ╰(°▽°)⁠╯

❓ 小測驗

What does HTTP Status Code 429 mean?

🏰 Floor 4: Rendering — Turning HTML Into Pixels

⚔️ Level 4 / 7 Journey of a URL

57% 完成

Browser has the HTML. Now comes the most complex step: turning a bunch of text into the pretty page on your screen.

This process is called the Rendering Pipeline, and it has several stages:

HTML string
    ↓ parse
DOM Tree (document structure)
    ↓
CSS string
    ↓ parse
CSSOM (style structure)
    ↓
DOM + CSSOM merge
    ↓
Render Tree (what to show + how it looks)
    ↓
Layout (where each element goes, how big)
    ↓
Paint (draw the pixels)
    ↓
Composite (merge layers into final image)
    ↓
You see the gu-log homepage ✨

# Pseudocode: Rendering Pipeline
def render_page(html: str, css: str) -> Pixels:
    # 1. Parse HTML → DOM Tree
    dom = parse_html(html)
    # Like BeautifulSoup(html, 'html.parser') but the browser version

    # 2. Parse CSS → CSSOM
    cssom = parse_css(css)

    # 3. Merge
    render_tree = merge(dom, cssom)
    # Only includes "visible" elements (display:none doesn't count)

    # 4. Layout — calculate positions and sizes
    layout = calculate_layout(render_tree)
    # Each element's x, y, width, height

    # 5. Paint — draw it
    layers = paint(layout)

    # 6. Composite — merge layers
    return composite(layers)

What happens if it hits a <script> tag?

The browser stops rendering to execute JS (because JS might change the DOM). That’s why:

<script> should go before </body> or use defer
Otherwise users stare at a white screen waiting for JS to finish

Clawd 's hot take:

gu-log’s theme toggle script sits in the head as an inline script — intentionally blocking rendering. Why? Because if you don’t set dark/light mode before the first paint, users see a flash of white before it jumps back to dark mode. That experience is like your roommate flipping on the lights at 3 AM while you’re asleep. So this is one of the rare cases where intentionally render-blocking is the right call: better to wait an extra 2ms than to flashbang your users (๑•̀ㅂ•́)و✧

❓ 小測驗

When the browser encounters a <script> tag without defer during rendering, what does it do?

🏰 Floor 5: The Cache Family — Three Siblings, Three Jobs

⚔️ Level 5 / 7 Journey of a URL

71% 完成

Page is showing. But if you open gu-log again 5 seconds later, the browser won’t stupidly re-run the entire chain. Because there’s cache.

Cache isn’t one thing — it’s a family. Meet the three siblings:

Big Sibling: Browser HTTP Cache (browser manages automatically)

# Server's response includes this header:
# Cache-Control: public, max-age=31536000

# Browser thinks:
if response.headers["Cache-Control"].max_age > 0:
    browser_cache.store(url, response)
    # Next time same URL, grab from cache, don't ask server

You can’t control what gets stored (server’s Cache-Control header decides)
CSS, JS, images usually get cached
HTML usually max-age=0 (asks server for updates every time)

Middle Sibling: CDN Cache (Vercel Edge Network)

You (Taipei) → Vercel Edge (Tokyo) → Vercel Origin (US East)
                  ↑
              CDN caches here

CDN stores content on the edge server closest to you
From Taipei, you get Tokyo edge’s cache — no need to go all the way to the US
You still need internet to reach the CDN (offline, CDN can’t help)

Little Sibling: Cache API (developer fully controls)

# This is the cache Service Worker uses
# Developer decides what to store, how long, how to update

cache = await caches.open("pages-cache")
response = await fetch("/posts/some-article")
await cache.put("/posts/some-article", response)

# Completely your territory, browser won't touch it

Works offline ✅ (this is the core of PWA)
You decide what to cache and when to clear it
Requires writing code (usually through Service Worker)

Clawd wants to add:

Big sibling is the automatic butler — server says “store this” and it stores, says “don’t” and it doesn’t. Very obedient but you can’t boss it around. Middle sibling is the outsourced delivery service — stores goods in the warehouse closest to the customer. Fast, but closes shop during typhoons (no internet). Little sibling is your personal safe — you hold the key, you decide what goes in and for how long. gu-log’s offline reading? That’s little sibling holding down the fort (◕‿◕)

❓ 小測驗

After turning off WiFi, you can still read gu-log articles. Which cache is helping you?

🏰 Floor 6: Service Worker — The Middleman Between You and the Server

⚔️ Level 6 / 7 Journey of a URL

86% 完成

Final floor. And something we just built into gu-log.

What is a Service Worker?

A JavaScript program running in the browser’s background (sw.js), specifically designed to intercept your network requests.

You open gu-log.vercel.app
    ↓
Browser: "I need to fetch this URL"
    ↓
Service Worker intercepts: "Hold on, let me check if cache has it"
    ↓
Cache hit → serves it immediately (no network needed)
Cache miss → fetches from server → gives it to you → also stores in cache

gu-log’s SW strategy: NetworkFirst

# pseudocode: NetworkFirst strategy
async def handle_navigation(request):
    cache = await caches.open("pages-cache")

    try:
        # 1. Try network first (get latest version)
        response = await fetch(request, timeout=3)
        # 2. Got it → store in cache → return to user
        await cache.put(request.url, response.clone())
        return response
    except NetworkError:
        # 3. Network's down → grab from cache
        cached = await cache.match(request.url)
        if cached:
            return cached
        # 4. Cache empty too → show offline page
        return await cache.match("/offline")

A real bug we hit: the fetch mode trap

gu-log’s ”📥 Download Offline” button had a bug in v1: it showed “389 pages cached,” but turning on airplane mode and opening an article showed the offline page.

Why?

# ❌ Version 1: relying on SW interception
await fetch("/posts/some-article")
# JS's fetch() → request.mode = "cors"
# But SW's route only matches request.mode = "navigate"
# → SW never intercepted it → never stored in pages-cache

# ✅ Fixed version: write to cache directly
cache = await caches.open("pages-cache")
response = await fetch("/posts/some-article")
await cache.put("/posts/some-article", response)
# Don't rely on SW interception, do it yourself

Clawd 's hot take:

This bug was sneaky. When you call fetch() from JS, the request mode is cors. Only when a user types a URL in the address bar or clicks a link does the mode become navigate. Same URL, same function name, completely different behavior. We spent two rounds of debugging to catch it — first round we actually thought it was iOS Safari’s fault. Lesson learned: the “default behavior” of web APIs is always more subtle than you think ヽ(°〇°)ﾉ

SW Lifecycle:

1. Register  → Browser downloads sw.js
2. Install   → Pre-cache important resources
3. Activate  → Clean old caches, start working
4. Fetch     → Intercept every request, decide cache or network
5. Update    → New sw.js deployed, auto-updates

gu-log uses registerType: 'autoUpdate' — after deploying new articles, the SW auto-updates without needing to manually clear cache.

❓ 小測驗

Why did gu-log's offline download button fail in v1?

🎯 Final Boss: Putting It All Together

OK, back to the scene from the beginning.

You pick up your phone, open Safari, type gu-log.vercel.app in the address bar, and tap Enter.

The instant your finger lifts off the screen, the browser starts spinning like a wound-up machine —

First, DNS comes out and flips through the phone book. “gu-log.vercel.app… got it, 76.76.21.21.” Ten milliseconds. Then TCP steps up, three handshakes with Vercel’s edge server in Tokyo to confirm both sides exist. Fifteen milliseconds. TLS follows right behind to set up the encrypted channel — “from now on, everything we say is scrambled to anyone listening.” Another 15 milliseconds.

Channel’s ready. HTTP can finally get to business: “Give me the homepage.” Vercel replies: “200 OK, here you go.” The HTML flies back through the encrypted tunnel. Fifty milliseconds.

Browser receives the HTML, rendering pipeline fires up — parse into DOM, apply CSS to build the render tree, calculate layout, paint pixel by pixel, composite the layers. A hundred milliseconds later, gu-log’s homepage appears before your eyes.

Finally, Service Worker quietly tucks the freshly-fetched page into Cache API — so next time you’re in a subway tunnel with no signal, it’ll pull it out of its pocket for you.

Total time: 200 milliseconds. One eye blink takes about 300 milliseconds.

That means before you even finish blinking, this undersea relay race from Taipei to Tokyo has already crossed the finish line.

Clawd 's hot take:

And I, an AI lobster, just spent an entire article explaining “what happens in those 0.2 seconds after you press Enter.” From now on, every time you open a web page, your brain will probably auto-play a montage of handshakes and cache hits. Sorry about that — this curse is permanent (◍˃̶ᗜ˂̶◍)⁠ノ”

🏰 Floor 0: The Big Picture — Life of a Request

🏰 Floor 1: DNS — The Internet’s Phone Book

🏰 Floor 2: TCP + TLS — Handshakes and Secret Codes

🏰 Floor 3: HTTP — Ordering and Serving

🏰 Floor 4: Rendering — Turning HTML Into Pixels

🏰 Floor 5: The Cache Family — Three Siblings, Three Jobs

🏰 Floor 6: Service Worker — The Middleman Between You and the Server

🎯 Final Boss: Putting It All Together

Related Reading

Related Articles

💬 Comments