Implementation Notes: The LC-3 Virtual Machine

April 2025

Introduction
C Function Bindings
Registers
Instruction Set
Condition Flags
Traps
Memory Mapped Registers
Memory Utilities
Helper Functions
Assembly Note
Main Loop
Instruction Implementations
- ADD
- AND
- NOT
- BR (Branch)
- JMP (Jump)
- JSR (Jump Register)
- LD (Load)
- LDI (Load Indirect)
- LDR (Load Register)
- LEA (Load Effective Address)
- ST (Store)
- STI (Store Indirect)
- STR (Store Register)
- TRAP
  - GETC
  - OUT
  - PUTS
  - IN
  - PUTSP
  - HALT

Introduction

This VM will simulate the LC-3, an educational computer architecture commonly used to teach university students computer architecture and assembly. It has a simplified instruction set compared to x86, but demonstrates the main ideas used by modern CPUs. LC-3 ISA

The LC-3 has the following features:

16 bit word size, 4 bit opcode, 12 bit param space
16 opcodes
8 general purpose registers, 1 program counter, 1 condition flag register
3 condition flags: P, Z, N
Big Endian

C Function Bindings

Our VM needs to interact with the terminal for input/output operations. We use these external C functions to handle terminal input properly:

disable_input_buffering:
- Puts the terminal in “raw” mode where key presses are immediately available
- Without this, input would only be sent after pressing Enter
- Returns an integer status code (0 for success)
restore_input_buffering:
- Restores the terminal to its normal “cooked” mode after program execution
- This ensures the terminal behaves normally after our program exits
check_key:
- Checks if a key has been pressed without blocking
- Returns true if a key is available to be read
- Allows our program to poll for keyboard input without halting execution

These functions are crucial for implementing proper keyboard handling in our VM, especially for trap routines that need to read input characters.

extern fn disable_input_buffering() c_int;
extern fn restore_input_buffering() void;
extern fn check_key() bool;

Registers

A register is a slot for storing a single value on the CPU. If we want to work with a piece of data, we need to load it into a register first. A small number of registers –> only a minimal number of values can be stored at once.

We can get around this by: 1. Loading a value from memory into a register 2. Calculating a value into another register 3. Storing the final result back into memory

The GP registers can be used to perform any program calculations. The Program Counter is the address of the next instruction in memory to execute. The Condition Flags tell us information about the previous calculation.

// Register int shortcuts
const R_COUNT = @intFromEnum(Registers.R_COUNT);
const R_PC = @intFromEnum(Registers.R_PC);
const R_COND = @intFromEnum(Registers.R_COND);
const R0 = @intFromEnum(Registers.R0);
const R1 = @intFromEnum(Registers.R1);
const R2 = @intFromEnum(Registers.R2);
const R3 = @intFromEnum(Registers.R3);
const R4 = @intFromEnum(Registers.R4);
const R5 = @intFromEnum(Registers.R5);
const R6 = @intFromEnum(Registers.R6);
const R7 = @intFromEnum(Registers.R7);

const KBSR = @intFromEnum(MemoryMappedRegisters.KBSR);
const KBDR = @intFromEnum(MemoryMappedRegisters.KBDR);

// We use a simple array to represent memory
const MEMORY_MAX = std.math.maxInt(u16);
var memory: [MEMORY_MAX]u16 = undefined;

// Registers
const Registers = enum(u16) { R0 = 0, R1, R2, R3, R4, R5, R6, R7, R_PC, R_COND, R_COUNT = 10 };

var registerStorage: [R_COUNT]u16 = undefined;

Instruction Set

An instruction is a command that tells the CPU to do some fundamental task. Instructions have both an opcode (task to perform) and parameters (inputs on the task being performed).

e.g. for the task 2 + 3:

the task is addition/+ (and the opcode is whatever the opcode is for addition)
the parameters are 2 and 3

Each opcode is one task the CPU knows how to do.

const Opcodes = enum(u16) {
    OP_BR, // BRANCH
    OP_ADD, // ADD
    OP_LD, // LOAD
    OP_ST, // STORE
    OP_JSR, // JUMP REGISTER
    OP_AND, // BITWISE AND
    OP_LDR, // LOAD REGISTER
    OP_STR, // STORE REGSITER
    OP_RTI, // UNUSED
    OP_NOT, // BITWISE NOT
    OP_LDI, // LOAD INDIRECT
    OP_STI, // STORE INDIRECT
    OP_JMP, // JUMP
    OP_RES, // RESERVED (UNUSED)
    OP_LEA, // LOAD EFFECTIVE ADDRESS
    OP_TRAP, // EXECUTE TRAP
};

Condition Flags

Condition flags provide information on the most recently executed calculation. Also allows us to check logical conditions. Each flag indicates the the sign of the previous calculation.

const conditionFlags = enum(u16) {
    FL_POS = (1 << 0), // (P)ositive
    FL_ZRO = (1 << 1), // (Z)ero
    FL_NEG = (1 << 2), // (N)egative
};

Traps

Trap routines are routines for getting input from the keyboard and displaying strings to the console. You can almost imagine these as an API for the LC-3. Each trap routine is assigned a trap code which operates like an opcode. When a trap routine is executed, we move the program counter to the routine’s address and execute the procedure’s instructions, then move the PC back to where it was before the routine. Trap routines technically don’t add any new functionality to our system, they just provide a convenient way to perform tasks.

const TrapCodes = enum(u16) {
    GETC = 0x20, // Get char from keyboard, not echoed onto terminal
    OUT = 0x21, // Output a char
    PUTS = 0x22, // Output a word string
    IN = 0x23, // Get char from keyboard, echoed onto terminal
    PUTSP = 0x24, // Output a byte string
    HALT = 0x25, // Halt the program
};

Memory Mapped Registers

These are special registers that are inaccessible from the normal register table. Instead we reserve a special address for them in memory and read and rite only to their memory location. We mostly use these to interact with special hardware devices (such as a keyboard).

LC-3 has 2 MMRs we need to implement:

KBSR: Keyboard status register, indicates whether a key has been pressed
KBDR: Keyboard sata register, idicates which key was pressed

We use these because they are non-blocking, they allow us to poll the state of the keyboard and continue execution.

The other (RE: simpler) way to do this is to use the GETC trap code, but this block execution

const MemoryMappedRegisters = enum(u16) {
    KBSR = 0xFE00, // Keyboard status
    KBDR = 0xFE02, // Keyboard data
};

Memory Utilities

These are helper functions for reading and writing to memory. We use these because we need to handle the special case of the keyboard status register. When we read from the KBSR, we need to check if a key has been pressed. If so, we need to read the key from the keyboard and store it in the KBDR. We also need to update the KBSR to reflect that a key has been pressed. This is a non-blocking operation, so we can continue execution without blocking.

fn memRead(addr: u16) !u16 {
    if (addr == KBSR) {
        if (check_key()) {
            memory[KBSR] = (1 << 15);
            memory[KBDR] = std.io.getStdIn().reader().readInt(u16, .big) catch {
                return error.InputError;
            };
        } else {
            memory[KBSR] = 0;
        }
    }
    return memory[addr];
}

fn memWrite(address: u16, val: u16) void {
    // Check if address is valid
    if (address == MEMORY_MAX) {
        std.debug.print("Warning: Attempted to write to max memory address 0x{x}\n", .{address});
        return;
    }
    memory[address] = val;
}

Helper Functions

The VM uses several helper functions to support its operation:

swap16

Swaps the byte order of a 16-bit number (endianness conversion)

Input: 16-bit integer
Output: Same integer with bytes swapped
Used when loading program images that may have different endianness

fn swap16(x: u16) u16 {
    return (x << 8) | (x >> 8);
}

handleCLArgs

Processes command line arguments to control VM behavior

Handles -h/--help flag to display usage info
Handles -i/--image flag to load program images
Reads binary program file and loads it into VM memory
Performs endianness conversion on loaded instructions

fn handleCLArgs(args: *std.process.ArgIterator, allocator: std.mem.Allocator) !void {
    while (args.next()) |arg| {
        if (eql(u8, arg, "-h") or eql(u8, arg, "--help")) {
            const help =
                \\Usage: vm [options]
                \\ Options:
                \\     -h, --help  Display this help message
                \\     -i, --image  Load a program image
            ;

            std.debug.print(help, .{});
            std.process.exit(0);
        } else if (eql(u8, arg, "-i") or eql(u8, arg, "--image")) {
            // Check if image path is provided
            const image_path = args.next() orelse {
                std.log.err("Error: No image path provided\n", .{});
                return error.NoImagePath;
            };

            // Open the file
            const file = try std.fs.cwd().openFile(image_path, .{});
            defer file.close();

            // Read origin (first 2 bytes)
            var origin_bytes: [2]u8 = undefined;
            _ = try file.read(&origin_bytes);
            const origin: u16 = std.mem.readInt(u16, &origin_bytes, .big);

            // Calculate maximum bytes we can read
            // const max_read = MEMORY_MAX - origin;
            const file_size = (try file.stat()).size;

            // Read the rest of the file
            var buffer = try allocator.alloc(u8, file_size - 2);
            defer allocator.free(buffer);
            _ = try file.read(buffer);

            // Copy to memory and swap endianness
            // TODO: figure out how to do this without a loop
            var i: usize = 0;
            while (i < buffer.len - 1) : (i += 2) {
                const value = std.mem.readInt(u16, buffer[i..][0..2], .big);
                memory[origin + (i / 2)] = value;
            }
        } else {
            const help =
                \\Usage: vm [options]
                \\ Options:
                \\     -h, --help  Display this help message
                \\     -i, --image  Load a program image
            ;

            std.debug.print(help, .{});
            std.process.exit(1);
        }
    }
}

signExtend

Sign extends an N-bit number to 16 bits

Used to preserve negative numbers when expanding bit width
Takes original value and bit count
Sets upper bits based on sign bit of original value

NOTE: Realistically this could just be a stdlib function

fn signExtend(x: u16, comptime bitCount: u4) u16 {
    if (((x >> (bitCount - 1)) & 1) == 1) {
        return x | (@as(u16, 0xFFFF) << bitCount);
    }
    return x;
}

updateFlags

Updates condition flags based on a register value

Sets appropriate flag (NEG/ZERO/POS) based on register contents
Ensures exactly one condition flag is set at all times
Used after operations that modify registers

fn updateFlags(r: u16) void {
    if (registerStorage[r] == 0) {
        registerStorage[R_COND] = @intFromEnum(conditionFlags.FL_ZRO);
    } else if ((registerStorage[r] >> 15) == 1) {
        registerStorage[R_COND] = @intFromEnum(conditionFlags.FL_NEG);
    } else {
        registerStorage[R_COND] = @intFromEnum(conditionFlags.FL_POS);
    }
}

Assembly Note

At the base of our VM we want to transform assembly (very low level human readable code) into 16-bit binary instructions the vm can understand. This transformer is called an assembler, and the resultant instructions are called machine code.

NOTE: An assembler and a compiler are not the same thing! An assembler simply encodes what the programmer has written in text into binary, replacing symbols with their binary representation and packing them into instructions.

Main Loop

The main loop of our VM is quite simple:

Load an instruction from memory at the address in the PC register
Increment the PC register
Look at the opcode to detemine which type of instruction it should perform
Perform the instruction using the params in the instruction

Repeat!

pub fn main() !void {
    var gpa = std.heap.GeneralPurposeAllocator(.{}){};
    defer {
        const deinit_status = gpa.deinit();
        if (deinit_status == .leak) std.testing.expect(false) catch @panic("TEST FAIL");
    }

    var args = try std.process.argsWithAllocator(gpa.allocator());
    defer args.deinit();

    _ = args.skip();

    try handleCLArgs(&args, gpa.allocator());

    _ = disable_input_buffering();
    defer restore_input_buffering();

    // We need to have 1 condition flag set at any given time, so we set the Z flag
    registerStorage[R_COND] = @intFromEnum(conditionFlags.FL_ZRO);

    // Set the PC to the starting position
    const PC_START = 0x3000;
    registerStorage[R_PC] = PC_START;

    // TODO for some reason this needs a try? figure out why
    try cpu: while (true) {
        // fetch instruction
        const instruct: u16 = try memRead(registerStorage[R_PC]);
        const op: Opcodes = @enumFromInt(instruct >> 12);

        // Fix: Use wrapping addition to prevent overflow
        registerStorage[R_PC] = @addWithOverflow(registerStorage[R_PC], 1)[0];

        // Switch on the opcode
        switch (op) {
            // ... instruction implementations ...
        }
    };
}

Instruction Implementations

ADD

ADD takes two values, adds them, and stores them in a register. The encoding for ADD instructions generally goes (from left to right):

opcode value (4 bits, 0001 for ADD)
destination register (3 bits)
SR1, or the first number to add (3 bits)
mode flag (1 bit)

After this it changes…

The ADD instruction has two different modes that can alter the encoding of the instruction:

Register mode: The second number is stored in a register, just like the first
- Uses 3 bits (bits 0-2, bits 3-4 are unused)
- In assembly this would look like this: ADD R2 R0 R1 ; add the contents of R0 to R1 and store in R2.
Immediate mode: The second number is stored directly in the instruction
- Uses 5 bits (bits 0-4)
- This removes the need to load the value from memory, but also limits the size of the second value (max 2^5 = 32)
- Due to this limitation, immediate mode is primarily useful for incrementing instructions
- In assembly this would look like this: ADD R2 R0 R1 ; add the contents of R0 to R1 and store in R2.
- In addition, to add a 5-bit number to a 16-bit one, we need to extend the 5-bit number to 16-bits. This process is called sign-extending
  - For positive numbers we can fill in 0s, for negative numbers we can fill in 1s

Finally, regardless of mode, we have to update the condition flags of a value to indicate its sign ANY time we write to a register

.OP_ADD => {
    // Get destination register
    const r0: u16 = (instruct >> 9) & 0x7;

    // Get first param
    const r1: u16 = (instruct >> 6) & 0x7;

    // Check for immediate mode
    const immFlag: u16 = (instruct >> 5) & 0x1;

    if (immFlag == 1) {
        const imm5: u16 = signExtend(instruct & 0x1F, 5);
        // Fix: Use wrapping addition to prevent overflow
        registerStorage[r0] = @addWithOverflow(registerStorage[r1], imm5)[0];
    } else {
        const r2: u16 = instruct & 0x7;
        // Fix: Use wrapping addition to prevent overflow
        registerStorage[r0] = @addWithOverflow(registerStorage[r1], registerStorage[r2])[0];
    }

    updateFlags(r0);
},

AND

AND takes two values and does a bitwise calculation, where if the bits of the param match a resultant 1 is produced, otherwise a 0. The resulting complete value is then stored in the destination register. AND uses immediate and register modes like ADD, and has the exact same encoding.

.OP_AND => {
    // Get destination register
    const r0: u16 = (instruct >> 9) & 0x7;

    // Get first param
    const r1: u16 = (instruct >> 6) & 0x7;

    // Check for immediate mode
    const immFlag: u16 = (instruct >> 5) & 0x1;

    if (immFlag == 1) {
        const imm5: u16 = signExtend(instruct & 0x1F, 5);
        registerStorage[r0] = registerStorage[r1] & imm5;
    } else {
        const r2: u16 = instruct & 0x7;
        registerStorage[r0] = registerStorage[r1] & registerStorage[r2];
    }

    updateFlags(r0);
},

NOT

NOT returns the bitwise complement of the param given. Encoding:

Opcode (4 bits, should be 1001)
Destination register (3 bits)
SR1, the register of our only param (3 bits)
The bits [5:0] are filled with 1s and are unused

.OP_NOT => {
    // Get the destination register
    const r0: u16 = (instruct >> 9) & 0x7;

    // Get param register to flip
    const r1: u16 = (instruct >> 6) & 0x7;

    registerStorage[r0] = ~registerStorage[r1];

    updateFlags(r0);
},

BR (Branch)

If the condition flag in the flag register matches the condition flag in the encoding, jump to the address (PC + offset) and execute that instruction. Encoding:

Opcode (4 bits, should be 0000)
Condition flags (3 bits) - n, z, p flags that determine when to branch
PCoffset9 (9 bits) - Signed offset from the current PC

.OP_BR => {
    // Get program counter offset
    const pcOffset: u16 = signExtend(instruct & 0x1FF, 9);

    // Get condition flag
    const condFlag: u16 = (instruct >> 9) & 0x7;

    if ((condFlag & registerStorage[R_COND]) != 0) {
        // Fix: Use wrapping addition to prevent overflow
        registerStorage[R_PC] = @addWithOverflow(registerStorage[R_PC], pcOffset)[0];
    }
},

JMP (Jump)

Unconditionally jumps to the address stored in the base register. Encoding:

Opcode (4 bits, should be 1100)
Unused (3 bits)
Base register (3 bits) - Contains the address to jump to
Unused (6 bits)

.OP_JMP => {
    // Get destination register
    const r1: u16 = (instruct >> 6) & 0x7;

    registerStorage[R_PC] = registerStorage[r1];
},

JSR (Jump Register)

Jumps to a location in memory and saves return address. Encoding:

opcode (4 bits, should be 0100)
long flag (1 bit) - Determines if using JSR (1) or JSRR (0) mode
If JSR: PCoffset11 (11 bits) - Signed offset from current PC
If JSRR: unused (2 bits), BaseR (3 bits), unused (6 bits)

Always saves the incremented PC in R7 before jumping.

.OP_JSR => {
    // Check whether we're in long or short mode
    const longFlag = (instruct >> 11) & 1;

    // Save the address of the call routine in register 7
    registerStorage[R7] = registerStorage[R_PC];

    if (longFlag == 1) {
        // JSR case
        const longPcOffset: u16 = signExtend(instruct & 0x1FF, 11);
        // Fix: Use wrapping addition to prevent overflow
        registerStorage[R_PC] = @addWithOverflow(registerStorage[R_PC], longPcOffset)[0];
    } else {
        // JSRR case
        const r1: u16 = (instruct >> 6) & 0x7;
        // Fix: Use wrapping addition to prevent overflow
        registerStorage[R_PC] = @addWithOverflow(registerStorage[R_PC], registerStorage[r1])[0];
    }
},

LD (Load)

Loads a value from a location in memory into a register. Encoding:

opcode (4 bits, should be 0010)
destination register (3 bits)
PCoffset9 (9 bits) - Signed offset added to the current PC to get memory address

.OP_LD => {
    // Get destination register
    const r0: u16 = (instruct >> 9) & 0x7;

    // Get PC Offset
    const pcOffset: u16 = signExtend(instruct & 0x1FF, 9);

    // Fix: Use wrapping addition to prevent overflow
    const addr = @addWithOverflow(registerStorage[R_PC], pcOffset)[0];
    registerStorage[r0] = try memRead(addr);

    updateFlags(r0);
},

LDI (Load Indirect)

Loads a value from a location in memory into a register. Encoding:

opcode (4 bits, should be 0101)
destination register (3 bits)
PCoffset9 (9 bits) - Points to address containing final target address

We also need to sign-extend the PCoffset to make it a valid memory address. This may seem roundabout, considering the LD instruction exists, but it is incredibly useful. LD is limited to offsets that are 9 bits, but our memory needs 16 bit addresses. LDI is great for loading data that’s stored far away from the current PC, but to use it the address of the final location needs to be stored in a neighborhood nearby. It’s similar in concept to having a local variable that’s a pointer to some address.

.OP_LDI => {
    // Get destination register
    const r0: u16 = (instruct >> 9) & 0x7;

    // Get PCoffset 9
    const pcOffset: u16 = signExtend(instruct & 0x1FF, 9);

    // Fix: Use wrapping addition to prevent overflow
    const addr = @addWithOverflow(registerStorage[R_PC], pcOffset)[0];
    // Add PCoffset to the current PC, look into that mem location to get the final address
    registerStorage[r0] = try memRead(try memRead(addr));

    // Update flags
    updateFlags(r0);
},

LDR (Load Register)

Loads a value from a location in memory into a register. Encoding:

opcode (4 bits, should be 0110)
destination register (3 bits)
base register (3 bits)
offset6 (6 bits) - Signed offset added to base register value

The offset is added to the base register to get the final address to load from. This is useful for loading data that’s stored in a neighborhood of the base register.

.OP_LDR => {
    // Get destination register
    const r0: u16 = (instruct >> 9) & 0x7;

    // Get base register
    const r1: u16 = (instruct >> 6) & 0x7;

    // Get offset
    const offset: u16 = signExtend(instruct & 0x3F, 6);

    // Fix: Use wrapping addition to prevent overflow
    const addr = @addWithOverflow(registerStorage[r1], offset)[0];
    registerStorage[r0] = try memRead(addr);

    // Update flags
    updateFlags(r0);
},

LEA (Load Effective Address)

Loads the address of a memory location into a register. Encoding:

opcode (4 bits, should be 1110)
destination register (3 bits)
PCoffset9 (9 bits) - Signed offset from current PC

Unlike other loads, this loads the address itself rather than the value at that address.

.OP_LEA => {
    // Get destination register
    const r0: u16 = (instruct >> 9) & 0x7;

    // Get sign extended offset
    const pcOffset: u16 = signExtend(instruct & 0x1FF, 9);

    // Fix: Use wrapping addition to prevent overflow
    registerStorage[r0] = @addWithOverflow(registerStorage[R_PC], pcOffset)[0];

    updateFlags(r0);
},

ST (Store)

Stores a register value into memory. Encoding:

opcode (4 bits, should be 0011)
source register (3 bits) - Register containing value to store
PCoffset9 (9 bits) - Signed offset from current PC for target address

.OP_ST => {
    // Get storage register
    const sr: u16 = (instruct >> 9) & 0x7;

    // Get PC offset
    const pcOffset: u16 = signExtend(instruct & 0x1FF, 9);

    // Fix: Use wrapping addition to prevent overflow
    const addr = @addWithOverflow(registerStorage[R_PC], pcOffset)[0];
    memWrite(addr, registerStorage[sr]);
},

STI (Store Indirect)

Stores a register value into memory using indirect addressing. Encoding:

opcode (4 bits, should be 1011)
source register (3 bits) - Register containing value to store
PCoffset9 (9 bits) - Points to address containing final target address

Similar to LDI, allows storing to addresses further than 9-bit offset would allow.

.OP_STI => {
    // Get storage register
    const sr: u16 = (instruct >> 9) & 0x7;

    // Get PC offset
    const pcOffset: u16 = signExtend(instruct & 0x1FF, 9);

    // Fix: Use wrapping addition to prevent overflow
    const addr = @addWithOverflow(registerStorage[R_PC], pcOffset)[0];
    memWrite(try memRead(addr), registerStorage[sr]);
},

STR (Store Register)

Stores a register value into memory using base+offset addressing. Encoding:

opcode (4 bits, should be 0111)
source register (3 bits) - Register containing value to store
base register (3 bits) - Contains base address
offset6 (6 bits) - Signed offset added to base register value

.OP_STR => {
    // Get storage register
    const sr: u16 = (instruct >> 9) & 0x7;

    // Get base register
    const br: u16 = (instruct >> 6) & 0x7;

    // Get memory offset
    const offset: u16 = signExtend(instruct & 0x3F, 6);

    // Fix: Use wrapping addition to prevent overflow
    const addr = @addWithOverflow(registerStorage[br], offset)[0];
    memWrite(addr, registerStorage[sr]);
},

TRAP

System call that performs various I/O and control operations. Encoding:

opcode (4 bits, should be 1111)
unused (4 bits)
trapvect8 (8 bits) - Index into trap vector table

.OP_TRAP => {
    // Store current PC in a register
    registerStorage[R7] = registerStorage[R_PC];

    // Get the trap code
    const trapCode: TrapCodes = @enumFromInt(instruct & 0xFF);

    // Switch on the trap code
    switch (trapCode) {
        // ... trap implementations ...
    }
},

GETC

GETC (0x20)

Reads a single character from the keyboard.
The character is not echoed to the console.
The ASCII code for the character is stored in R0.

.GETC => {
    const stdin = std.io.getStdIn().reader();
    const c = stdin.readByte() catch {
        break error.InputError;
    };
    registerStorage[R0] = c;
    updateFlags(R0);
},

OUT

OUT (0x21)

Writes a single character to the console.
The character is taken from R0 (only the least significant 8 bits are used).

.OUT => {
    const c: u8 = @truncate(registerStorage[R0]);
    std.io.getStdOut().writer().writeByte(c) catch {
        break error.OutputError;
    };
},

PUTS

PUTS (0x22)

Writes a string of ASCII characters to the console.
The characters are taken from consecutive memory locations, one character per memory location,
starting with the address in R0 and continuing until a null terminator (0x0000) is encountered.

.PUTS => {
    var address = registerStorage[R0];
    var char: u16 = try memRead(address);
    while (char != 0) {
        const c: u8 = @truncate(char);
        try std.io.getStdOut().writer().writeByte(c);
        // Fix: Use wrapping addition to prevent overflow
        address = @addWithOverflow(address, 1)[0];
        char = try memRead(address);
    }
},

IN

IN (0x23)

Prints a prompt on the screen and reads a single character from the keyboard.
The character is echoed to the console, and its ASCII code is stored in R0.

.IN => {
    const stdout = std.io.getStdOut().writer();
    const stdin = std.io.getStdIn().reader();

    try stdout.writeAll("Enter a character: ");

    // Read single character
    const c = stdin.readByte() catch {
        break error.InputError;
    };

    // Echo the character
    try stdout.writeByte(c);
    try stdout.writeByte('\n');

    // Store in R0 and update flags
    registerStorage[R0] = c;
    updateFlags(R0);
},

PUTSP

PUTSP (0x24)

Writes a string of ASCII characters to the console.
The characters are stored in consecutive memory locations, two characters per memory location,
starting with the address in R0 and continuing until a null terminator (0x0000) is encountered.
The low byte of each memory location is the first character, and the high byte is the second.

.PUTSP => {
    const stdout = std.io.getStdOut().writer();
    var address = registerStorage[R0];
    var value: u16 = try memRead(address);

    while (value != 0) {
        const char1: u8 = @truncate(value & 0xFF);
        stdout.writeByte(char1) catch return;

        const char2: u8 = @truncate(value >> 8);
        if (char2 != 0) {
            stdout.writeByte(char2) catch return;
        }

        // Fix: Use wrapping addition to prevent overflow
        address = @addWithOverflow(address, 1)[0];
        value = try memRead(address);
    }
},

HALT

HALT (0x25)

Halts execution of the program and displays a message on the console.
This is equivalent to executing a BREAK instruction in the simulator.

.HALT => {
    std.log.err("HALT", .{});
    break :cpu;
},

Resources

For those interested in exploring this project further:

GitHub - The complete source code for this LC-3 virtual machine implementation
Guide - An excellent tutorial by Justin Meiners and Ryan Pendleton that served as a reference for this implementation

Jordan Streete

Implementation Notes: The LC-3 Virtual Machine

Table of Contents

Introduction

C Function Bindings

Registers

Instruction Set

Condition Flags

Traps

Memory Mapped Registers

Memory Utilities

Helper Functions

swap16

handleCLArgs

signExtend

updateFlags

Assembly Note

Main Loop

Instruction Implementations

ADD

AND

NOT

BR (Branch)

JMP (Jump)

JSR (Jump Register)

LD (Load)

LDI (Load Indirect)

LDR (Load Register)

LEA (Load Effective Address)

ST (Store)

STI (Store Indirect)

STR (Store Register)

TRAP

GETC

OUT

PUTS

IN

PUTSP

HALT

Resources