Category Archives: _Deep Dive 🔍

Longer, more detailed posts that require significant research.

Linux Internals: How /proc/self/mem writes to unwritable memory

May 12, 2021: _Deep Dive 🔍, C, Favorite, Linux Kernel

Introduction

An obscure quirk of the /proc/*/mem pseudofile is its “punch through” semantics. Writes performed through this file will succeed even if the destination virtual memory is marked unwritable. In fact, this behavior is intentional and actively used by projects such as the Julia JIT compiler and rr debugger.

This behavior raises some questions: Is privileged code subject to virtual memory permissions? In general, to what degree can the hardware inhibit kernel memory access?

By exploring these questions¹, this article will shed light on the nuanced relationship between an operating system and the hardware it runs on. We’ll examine the constraints the CPU can impose on the kernel, and how the kernel can bypass these constraints.

Continue reading →

Open source licensing for supervillains

1 Reply

January 22, 2021: _Deep Dive 🔍, Legal

This post covers my research into open source software licensing and my analysis of real-world open source projects that profit off of open source code via proprietary licenses.

Keep reading and you’ll learn:

What the difference between a restrictive and permissive license is
What dual licensing is and how you can use it make money off of open source code
What CLAs are and the specific clause your CLA needs for use with dual licensing
Examples of companies that implement dual licensing and how they do it

And of course: I am not a lawyer and none of this is legal advice.

Let’s talk evil. And by evil, I mean money.

Continue reading →

What they don’t tell you about demand paging in school

4 Replies

October 14, 2020: _Deep Dive 🔍, C++, Favorite, Linux, Linux Kernel

This post details my adventures with the Linux virtual memory subsystem, and my discovery of a creative way to taunt the OOM (out of memory) killer by accumulating memory in the kernel, rather than in userspace.

Keep reading and you’ll learn:

Internal details of the Linux kernel’s demand paging implementation
How to exploit virtual memory to implement highly efficient sparse data structures
What page tables are and how to calculate the memory overhead incurred by them
A cute way to get killed by the OOM killer while appearing to consume very little memory (great for parties)

Note: Victor Michel wrote a great follow up to this post here.

Continue reading →

How setjmp and longjmp work (2016)

2 Replies

February 9, 2016: _Deep Dive 🔍, Assembly, C, Favorite, Linux

Pretty recently I learned about setjmp() and longjmp(). They’re a neat pair of libc functions which allow you to save your program’s current execution context and resume it at an arbitrary point in the future (with some caveats²). If you’re wondering why this is particularly useful, to quote the manpage, one of their main use cases is “…for dealing with errors and interrupts encountered in a low-level subroutine of a program.” These functions can be used for more sophisticated error handling than simple error code return values.

I was curious how these functions worked, so I decided to take a look at musl libc’s implementation for x86. First, I’ll explain their interfaces and show an example usage program. Next, since this post isn’t aimed at the assembly wizard, I’ll cover some basics of x86 and Linux calling convention to provide some required background knowledge. Lastly, I’ll walk through the source, line by line.

Continue reading →

offlinemark

Life, art, and systems programming

Category Archives: _Deep Dive 🔍

Linux Internals: How /proc/self/mem writes to unwritable memory

Introduction

Open source licensing for supervillains

What they don’t tell you about demand paging in school

How setjmp and longjmp work (2016)