OS Internals Interview Guide¶

Virtual Memory¶

→ src/systems/os/virtual_memory.cpp

Multi-level (x86_64: 4-level, PML4 → PDP → PD → PT)
Each entry: physical frame number + flags (present, writable, user, dirty, accessed)
Page walk: ~4 memory accesses without TLB

Step	What Happens
1	Save CPU registers (general, FP, SIMD) to kernel stack
2	Save TLB state / flush (if different address space)
3	Switch page tables (CR3 on x86)
4	Restore target process registers
5	Return to user-space

Cost: ~1-10μs (depends on TLB flush, cache cold start)

Algorithm	Type	Used In
CFS (Completely Fair Scheduler)	Weighted fair share, red-black tree	Linux default
RT (SCHED_FIFO / SCHED_RR)	Priority-based, preemptive	Real-time tasks
SCHED_DEADLINE	Earliest deadline first	Deadline-critical workloads
O(1) scheduler	Priority arrays, O(1) pick	Older Linux (pre-2.6.23)

→ src/systems/os/scheduler_sim.cpp

User-space → kernel transition via syscall instruction (x86_64)
Cost: ~100-200ns (register save + mode switch + validation)
vDSO: kernel-mapped into user-space for fast calls (clock_gettime, gettimeofday)
Minimize syscalls on hot path (batch I/O, io_uring)

Type	Source	Example
Hardware (IRQ)	Devices	NIC packet arrival, timer tick
Software (trap)	CPU instruction	`syscall`, `int 0x80`
Exception	CPU error	Page fault, divide by zero, GP fault

Interrupt flow: save context → IDT lookup → handler → iret

Question	Key Points
What happens on a page fault?	Check valid mapping → allocate frame → load from disk → update PT → restart instruction
Process vs thread?	Process = separate address space. Thread = shared address space, own stack/registers
What's a zombie process?	Terminated but parent hasn't called `wait()`. Entry remains in process table
Explain thrashing	Working set > physical memory → constant page faults → system crawls
Priority inversion	Low-priority thread holds lock needed by high-priority thread. Fix: priority inheritance
How does `malloc` work?	`brk`/`mmap` from kernel; user-space allocator manages free lists/arenas