Guide: Bare-Metal Programming using gcc


Written by Pat Hanrahan and Julie Zelenski

This guide gives a brief overview of what is unique about compiling C programs to execute in a bare-metal environment.

Hosted versus non-hosted environments

A typical program is compiled for a hosted system where it has access to the standard libraries and facilities provided by the operating system layer. In hosted mode, the program runs at the pleasure of the host operating system. In contrast, a bare-metal program is non-hosted; it does not stand on top of an operating system or library; it runs entirely on its own. The program has the freedom to do whatever it wants, without any pesky interference from an OS overlord, but cannot count on any facilities other than what it provides for itself.

A non-hosted program runs on "bare-metal". In compiler-speak, the bare-metal environment is referred to as standalone or freestanding.

By default, gcc compiles assuming a hosted environment, since this is the common case. To properly compile a bare metal program, we need to set the appropriate compiler and linker options to ensure the program is configured to run standalone.

Compiler option -ffreestanding

This gcc option directs the compiler to limit this program to only those features available in the freestanding environment.

$ riscv64-unknown-elf-gcc -ffreestanding -c blink.c

In freestanding mode, the only available standard header files are: <float.h>, <iso646.h>, <limits.h>, <stdarg.h>, <stdbool.h>, <stddef.h>, and <stdint.h> (C99 standard 4.6). These headers define the types appropriate for the machine being used, as well as basic constants such as the minimum and maximum values for different types. None of the other standard header files (<stdio.h>, <string.h> and so on) are available; these headers cannot be included or used.

In hosted mode, the main function must adhere to a rigid specification. Execution begins at the function named main and its signature must typically match:

int main(int argv, char *argv[])   // main in hosted env

The compiler will issue warnings if you define main differently for a hosted program.

Freestanding mode removes the special semantics for the main function. In the standalone world, main can have any type signature and it is configurable whether it is main or some other function that starts the program. A typical main signature for a freestanding program is simply:

void main(void)                                // main in bare metal env

The -ffreestanding option also directs the compiler to not assume that standard functions have their usual definitions. This prevents the compiler from making optimizations based on assumptions about the behaviors of the standard libraries. For example, in a hosted environment,gcc is assured that the available library meets the specification of the language standard. It can transform printf("hi\n") into puts("hi") because it knows from the definition of the standard IO library that these two functions behave equivalently in this case. In freestanding mode, if there are functions puts and printf available, they must be your own custom versions and will behave in whatever way you chose, and the compiler will not make any assumptions about what substitutions are appropriate. When -ffreestanding is used, gcc does not assume a standard library environment and will not make such transformations.

It maybe a bit surprising to learn that even when compiling in freestanding mode, gcc can emit a call to the standard library function memcpy or memset. It uses these routines to block-copy a large-ish chunk of data, such as when initializing an array or struct or passing a struct in or out of a function. As a freestanding program does not have access to the standard library, it must supply its own implementations of these functions.

Linker option -nostdlib

The linker option -nostdlib is used to link a program intended to run standalone. -nostdlib implies the individual options -nodefaultlibs and -nostartfiles. Below we discuss the two options separately, but the most typical use is just nostdlib for one-stop shopping.

-nodefaultlibs

When linking a hosted program, standard system libraries such as libc are linked by default, giving the program access to all standard functions (printf, strlen and friends). The linker option -nodefaultlibs disables linking with those default libraries; the only libraries linked are exactly those that you explicitly name to the linker using the -l flag.

libgcc.a is a standard library (linked by default, excluded by -nodefaultlibs) that provides compiler support routines that are needed by the compiler and software subroutines that substitute/augment what is available in hardware. For example, a RISC-V processor may not have hardware support for floating point. In such cases, linking with libgcc.a provides functions that emulate the floating point operations in software. A program that attempts to use software floating point and is linked -nodefaultlibs will fail to link. The linker errors will be something akin to

riscv64-unknown-elf-ld: main.c:9: undefined reference to `__floatsidf'
riscv64-unknown-elf-ld: main.c:10: undefined reference to `__adddf3'
riscv64-unknown-elf-ld: main.c:10: undefined reference to `__divdf3'

Linking with libgcc.a (-lgcc) will provide the missing routines and resolve these references.

Note that libgcc does not supply memcpy and related functions. Buried deep in https://gcc.gnu.org/onlinedocs/gcc/Standards.html, there is a small callout that notes this:

Most of the compiler support routines used by GCC are present in libgcc, but there are a few exceptions. GCC requires the freestanding environment provide memcpy, memmove, memset and memcmp.

If your program requires one of these routines, you will need to supply it yourself.

-nostartfiles

Normally, when a program begins to run, the standard start function is called. This function sets up the machine to run the program. A common task performed by start is to configure the program segments (stack, global data, heap) and call the main function.

The option -nostartfiles instructs the linker to not use the standard system startup functions nor link the code containing those functions.

If you don't link to a start function, program state may not be properly initialized. You will need to provide your own start function when running in standalone mode.