Chip-8 Emulator

Team:
- Xin Gao (xg2376)
- Daniel Indictor (di2215)
- Elysia Witham (ew2632)
- Yuhang Zhu (yz4136)
FSM Diagram

- **Direct Write**
  - "Go bit" False
  - Write state data according to bus.
  - "Go bit" false && delay timer timeout
  - delayTimer = 0

- **Wait**
  - "Go bit" false && no delay timer timeout
  - delayTimer++
  - Execution finished
  - send writeReady, delayTimer++

- **Fetch/Decode**
  - "Go bit" true
  - delayTimer=0
  - Load RAM[PC], delayTimer++

- **Execute**
  - Execution not finished
  - Continue executing, delayTimer++
Code Layout
Chip-8

- **hw/**
  - Contains SystemVerilog modules (next slide)
  - Contains module-level unit tests
- **sw/**
  - Contains kernel driver (chip8.c)
  - Contains user-space application
- **swproto/**
  - Contains software prototype
  - Contains unit tests
Chip-8 Hardware

- **display_ram**: A simple 256-byte two-port read-write RAM to be read by the display module and written by either the software driver or the CPU module.
- **chip_ram**: A 4096-byte two-port read-write RAM to be read by the CPU and written by the CPU or the software driver.
- **display**: This module does the job of reading data from the display ram and scaling/fitting the 64x32 pixel display native to CHIP8 to a 640x480 pixel VGA display. It thus also handles setting VGA protocol signals.
- **cpu**: This module is incomplete (as you shall see) but in theory it should house the finite state machine given earlier and a huge switch statement to handle the opcodes.
- **sound**: This module is complete (but not integrated). In theory, it should just make the FPGA make a sound on its AUX port given the signal to do so.
- **yachip**: This is the top-level module (completed but not tested) that delegates access to the display and chip8 rams to the CPU and display module. As discussed, it gives priority to the software driver.
- **timer**: Timers are necessary in three parts in our code: first, for two special purpose registers which, when set to a non-zero value tick down to 0 at 60Hz, and another to slow down the CPU.
VGA Display

Ram module
Chip8 only had 64x32 pixel display
Interface with CPU.
● Generate a memory initialization file (.mif) for audio effect
● Single-port ROM memory blocks
● Use 3 IPs in Qsys:
  ○ Audio and Video Config
  ○ Audio Clock for DE-series Board
  ○ Audio.
We opted to use a USB keyboard rather than a dedicated pin pad for CHIP8 inputs. Many online CHIP8 emulators support a keyboard with the same keys as the ones we are using mapped to the same inputs. The mapping is as follows:

```
1 2 3 4       1 2 3 C
q w e r       4 5 6 D
a s d f  -->  7 8 9 E
z x c v        A 0 B F
```

The left side represents a standard keyboard, with the right side representing what it is being translated as for the CHIP8 device.
We created a userspace application with a few pre-selected games which can be chosen with 5-9 number keys. Keyboard presses are captured by software, with the appropriate data being communicated to hardware. Once a game is selected via the keyboard, the hardware emulator will take over and a user will be able to play the selected game using the keyboard.
Software Prototype

[Image of a maze-like diagram]

```
$ ninja state_machin_test
(4/4) Linking CXX executable state_machin_test
Running main() from /home/daniel/Documents/college/embedded_systems_cseeeu4343/ninja

Running 23 tests from 1 test suite.

Global test environment set-up.

Global test environment tear-down.

OK  | StateMachineTest.Test0BEE.2nnn
OK  | StateMachineTest.Test0BEE.2nnn (0 ms)
OK  | StateMachineTest.Test0BEE.2nnn (0 ms)
OK  | StateMachineTest.Test11nnn
OK  | StateMachineTest.Test11nnn (0 ms)
OK  | StateMachineTest.Test13kk.4xk
OK  | StateMachineTest.Test13kk.4xk (0 ms)
OK  | StateMachineTest.Test13kk.4xk (0 ms)
OK  | StateMachineTest.Test15ku.9xk
OK  | StateMachineTest.Test15ku.9xk (0 ms)
OK  | StateMachineTest.Test1Buvu8 (0 ms)
OK  | StateMachineTest.Test1Buvu8 (0 ms)
OK  | StateMachineTest.Test1Buvu8 (0 ms)
OK  | StateMachineTest/Test1Buvu8 (0 ms)
OK  | StateMachineTest/Test1Buvu8 (0 ms)
OK  | StateMachineTest/Test1Buvu8 (0 ms)
OK  | StateMachineTest/Test1Buvu8 (0 ms)
OK  | StateMachineTest/Test1Buvu8 (0 ms)
OK  | StateMachineTest/Test1Buvu8 (0 ms)
OK  | StateMachineTest/Test1Buvu8 (0 ms)
OK  | StateMachineTest/Test1Buvu8 (0 ms)
```

Expected equality of these values:

```
machine.reg[1])
Which is = B4B
0x308
Which is = 288
```

```
FAILED | StateMachineTest.Test1Buvu8 (0 ms)
RUN    | StateMachineTest.TestFxB8
```
```cpp
1 chip_ram_test.cpp
2 #include "hu_test.hpp"
3 #include "Vchip_ram.h"
4 using ChipRamTest = VTest<Vchip_ram>;
5
6 TEST_F(ChipRamTest, TestSimpleWriteRead) {
7     eval();
8     dut.wa = 0;
9     dut.wb = 1;
10    // Write mem[0x01] = 0x23; mem[0x45] = 0x67;
11    dut.aa = 0x01;
12    dut.da = 0x23;
13    dut.ab = 0x45;
14    dut.db = 0x67;
15    dut.wa = 1;
16    dut.wb = 1;
17    ticktock();
18    ASSERT_EQ(dut.qa, 0x23);
19    ASSERT_EQ(dut.qb, 0x67);
20
21    // Make sure it does nothing with write enable off on port a.
22    dut.wa = 0;
23    dut.da = 0xff;
24    // Write mem[0x89] = 0xAB;
25    dut.ab = 0x89;
26    dut.db = 0xAB;
27    ticktock();
28    ASSERT_EQ(dut.qa, 0x23);
29    ASSERT_EQ(dut.qb, 0xAB);
30
31    // Read address 0x01 with port b.
32    dut.wb = 0;
33    dut.ab = 0x01;
34    ticktock();
35    ASSERT_EQ(dut.qa, 0x23);
36    ASSERT_EQ(dut.qb, 0x23);
```
Observations & Advice

- Decision to use C++ rather than C complicated the compilation process on the FPGA itself and our ability to use libraries that we learned about in labs, such as libusb and ioctl. We wound up having to cross compile C++ and C code at the end, which certainly increased the debugging time frame. Note to future students: getting cmake to work on Linux on the FPGA is a pain!

- Writing rigorous tests for all the modules was a good decision. Completing the modules would have been a better decision still