Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/rvantonder/lambda-gbc
https://github.com/rvantonder/lambda-gbc
Last synced: 22 days ago
JSON representation
- Host: GitHub
- URL: https://github.com/rvantonder/lambda-gbc
- Owner: rvantonder
- Created: 2016-07-29T18:22:26.000Z (over 8 years ago)
- Default Branch: master
- Last Pushed: 2018-05-25T02:28:16.000Z (over 6 years ago)
- Last Synced: 2024-12-09T05:36:32.627Z (27 days ago)
- Language: OCaml
- Size: 250 KB
- Stars: 1
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# IMPORTANT!!!
To RENDER with `step_frame`:
* Zoom out FIRST, all the way, and zoom in ONCE.
* ./driver.native --bootrom --speed 0.0
* THEN RESUME
* `step_frame` on empty requests in loop for FASTTo RENDER ON DEMAND wtih `step_insn`:
* Zoom out FIRST, all the way, and zoom in ONCE.
* `./driver.native --bootrom --nr --speed 0.0` ; use no-render, much faster execution
* `(bp 0xf9)`
* resume
* wait
* render## What is going on with speed?
Need to add a value permanently to memory if failed and found in image.
Left off:
Run
`LWT_LOG="clock -> debug" ./driver.native --bootrom --speed 0.0` for testing render version
`LWT_LOG="clock -> debug" ./driver.native --bootrom --speed 0.0 --nr` for no render- ~.148 with rendering on, gpu on, no logging
- ~.057 with rendering off, gpu on, no logging
- ~.025 with rendering off, gpu off, no logging
Breakdown of slow down:
- INTERPRETER only (comment out gpu): .0312
- means GPU : adds .057-.0312 = .025 abouts
- RENDERING: adds .09+
~3/4ths of slow down comes from draw_bg and lambda-term rendering. TODO: check if from_tile_list is a big culprit. I suspect the actual lambda term rendering is not so slow, because visually the blocks don't seem to delay too much (and we render each pixel in turn).
~1/4th comes from get_tiles
Possibilities:
- optimize rendering
- even interpreter is still twice as slow as our target time (without sprites)
- optimize gpu. this happens every instruction, critical.
Ideas:
- even faster array memory
- gpu ??## Design
Double synchrony. 1) Clock says emulation may continue after enough time has elapsed (60 hz). 2) drawing must happen after every 70224 instructions, regardless of what clock says.
## Debug labels
```
interrupt
render
gpu
cycles
ev_cpu_add_bp
ev_cpu_rq_snd
ev_cpu_bp_trigger
ev_cpu_dbg_eval
ev_cpu_dbg_pc_undef
ev_dbg_rq_rcv
ev_dbg_rq_rcv_in_blking
ev_dbg_rq_rcv_in_non_blking
```# Debug settings for screen:
`LWT_LOG="render -> debug; clock -> debug" ./driver.native --bootrom --nr --speed 0.0`
## TODO
Use `ctxt#update` to bind mem variable to a new bil result (storage)
Refactor memory writing to be like `wwrite_word` in interpreter.Even better: use `Stmt.eval` with context like in sync function.
be careful of on_success use in screen.
### GPU:
problem, we need to update scanline in memory (FF44). But the only interface to write to storage is through interpreter `store`. We cannot write to storage from ctxt. We can load, and we can call save, but calling save returns a storage which we cannot update context with.
It looks like base class exp contains `create_storage`.
-- we must initialize ff44 (can't be bot), because we expect a value for gpu to work. do we initialize it in boot, or do we
initialize it when we encounter bot in gpu.ml? right now, i initialize in boot### LCD
I need to write the scanline to 0xFF44 (using GPU module), otherwise there's never any 'waiting' before rendering.
http://www.codeslinger.co.uk/pages/projects/gameboy/lcd.html
I also need to implement interrupts properly or some games won't work
http://www.codeslinger.co.uk/pages/projects/gameboy/interupts.html
the vblank interrupt, when implementing correctly, should trigger 60 times a second.
Sometimes rendering logic code says:
"I won't continue until you've told me that you rendered the last scan line"
The GPU is running all the time and renders every 70244 seconds. But sometimes the
game is waiting on it... to set the last scan line properly.### TODO: decoder should have a decode function (mli) without needing continuation
--------------------------------------
### Interactive debugger: separate window with reg updates like GDB -- use widges/OO part of lterm
### RRender interactive -- works.
### Remote debug over stdin/network. May fix screen shit. -- use lterm and matrix w/ stdin/stdout?
### rewrite pattern matching to use this opcode knowledge: http://www.z80.info/decoding.htm--------------------------------------
How I could debug with no$gmb:
Window -> IOMap shows clock countdown to vblank and clock count up with trace.
Two problems: count up only shows `>64K` at some point AND I can't find the value in memory with art money to reset it to 0. F'ed.
But! I can track the countdown vblank clock with artmoney. I can also set it to like, 1000000. Problem: IOMap window then
displays `>64k`. No matter! Artmoney still shows me the value as its decrementing. Now I can count clocks. Yes.How to track it with art money? Hit search and start with the number. Then interact with program. Then filter for the number
you expect next.--------------------------------------
possible optimization:
make `rrender` work every 10k steps
# Left off:
No debug:
./driver.native --bootrom --speed 0.0Renders with `step_frame` in non_blocking mode (let rec loop function).
Need to make it render at the right time. Problem is: while rendering,
execution continues, and i can't see the screen moving. any attempt
to slow down or pause execution stops the rendering too: it's in the
same thread.Solution: correctly model gpu and rendering. Rendering depends on every clock
http://imrannazar.com/GameBoy-Emulation-in-JavaScript:-GPU-Timings
http://www.codeslinger.co.uk/pages/projects/gameboy/lcd.html "ctrl-f sync" "ctrl-f subtract"
Emulator::Update is 60 times every second. 69905 insn per 1/60th a second are executed, and then the screen is rendered. http://www.codeslinger.co.uk/pages/projects/gameboy/beginning.html
The 69905 number comes from here: http://www.codeslinger.co.uk/pages/projects/gameboy/beginning.html
Pandocs says 70244
```
Mode 0 is present between 201-207 clks, 2 about 77-83 clks, and 3 about 169-175 clks. A complete cycle through these states takes 456 clks. VBlank lasts 4560 clks. A complete screen refresh occurs every 70224 clks.)
```http://bgb.bircd.org/pandocs.htm#lcdpositionandscrolling
At this speed, the screen is rendered 60 times a second. The gpu clock is relative to this number.
LWT_LOG="debug" ./driver.native --bootrom --speed 0.0
## Hacks
a FRAME step is a RENDERING optimization. if we do `./driver.native --botrom --speed 0.0`, things
will work, but we don't want to render every time.Hack for now: render every 10k steps. For some reason the boot rom is only about 47k steps,
so the 69k steps isn't getting hit. we will sync this up with the step_frame once
the gpu works correctly.## Debugger commands:
`(bp 0x86)`
`(print insn)`
`(print regs)`
`(print (mem 0x0))`
`(step insn)`
`(step frame)`
`render`
`pause`
`resume`
`help`
Return: repeat last command.
## Debug SCY, cool example
```
(bp 0x86)
resume
(print (mem 0xFF42))
-> 64
resume
(print (mem 0xFF42))
-> 63
```## Small fixes for one day
use bap word for breakpoints/pc. not int (see debugger_types.mli)
debugger should nto have 'dump' methods. should return the value
and i should make a separate 'printer' class.## blah
by default, the interpreter continues with `next_insn` and not `next_frame`,
because with `next_frame`, on resume, the bp gets skipped (even though, in the
ideal implementation, it shouldn't)In other words, this should work, but doesnt:
```
(bp 0x4)
(step frame)
-> BP triggered!, entering blocking mode (already in blocking mode)
```Instead it just continues.
Whenever (step frame) is called we will batch-process an entire frame before
listening to any event again. That's why. Even though the bp triggered event is sent,
we are not aborting step frame because it was triggered. correct behavior would do this.To get this, we would need to listen for evnts after each insn step inside a
frame. Not going to bother. frames are there so we can execute big pieces with a
given syncrhony. only.## Logging
Only logging under section "ev_*" will be printed, and those that map to debug.
`"ev_* -> debug"`
`LWT_LOG="ev_* -> debug" ./driver.native --bootrom --speed 0.0`
All debug messages:
`LWT_LOG="debug" ./driver.native --bootrom --speed 0.0`
same as
`LWT_LOG="* -> debug" ./driver.native --bootrom --speed 0.0`
## The bug
Bil can be nested:
```
Sep 18 22:14:43: ev_int_dbg_eval: {
Sep 18 22:14:43: ev_int_dbg_eval: if (~FZ) {
Sep 18 22:14:43: ev_int_dbg_eval: jmp (0xC:16 + (extend:16[0xFB:8]))
Sep 18 22:14:43: ev_int_dbg_eval: }
Sep 18 22:14:43: ev_int_dbg_eval: }
```interpreter #eval is called recursively on structures like the above. First evaluates if, then jmp. So, when
we have something that sends events like 'pause', it may send it multiple times before returning.this could be fixd by putting it in 'step_insn', which is at least atomic.
But the bigger issue is, should we be using mailbox already, or the queue for lwt?
because the second part of this bug is that two events are added, and then we enter infniloop
## Design
```
driver.ml # initialize image/parse bootrom options
|
runner.ml # start the interpreter loop and debugger
|________________
/ \
input_loop interpreter_loop
| |
| ` processes all debug actions will enter blocking lwt mode when debug stepping.
parse send commands to interpreter recv stream
```## Boot
`DMG_ROM.bin` is the original boot rom that would get loaded at `0x0000`, run
the Nintendo logo check routine, and set PC to `0x100` to start running the
game cartridge. Note that the bootrom only sets up the initial state. Once it
is set up, the values in `0x0000-0x100` are flushed out and replaced by the
cartridge.`DMG_ROM.bin` has the boot sequence, but not the logo data. Setting PC to 0 and
executing will cause the ROM to lock up here: `JR NZ,$fe; $00e9;if not a match,
lock up here`. That's because the boot rom is separate from the cartridge, and
it compares logo data in the cartridge to the DMG rom values.To pass the check, I manually copied the logo data from somewhere to make
`DMG_ROM_WITH_SCREEN.bin` which passes the test. This does everything as before
like `DMG_ROM.bin`, but displays the nintendo screen, as it comes down in an
animation. Basically I stole enough of the nintendo logo from a different
cartridge and just embedded it into the rom image.To play with the boot instructions: run `DMG_ROM_WITH_SCREEN.bin`. Ignore the
pop-up. Then set PC to 0. Then `f9` to run. Or step.Ref:
http://bgb.bircd.org/pandocs.htm#powerupsequence
## Initial state
If you load tetris in no$gmb, you'll see that our emulator register state is
very close to the initial state (breakpoint at 0x101) in the emulator. That's
good news. The 'cheat' is to just set this initial register state and skip
doing the actual boot rom.Source:
http://marc.rawer.de/Gameboy/Docs/GBCPUman.pdf Page 17
http://bgb.bircd.org/pandocs.htm#powerupsequenceTest initial state with something like:
```
./driver.native --di --nr -v --file /tmp/q
` at every step
` use dummy file, not the default SCREEN one (execution corrupts starting values)
`-dump state```
## Debugging with tetris rom
Load the tetris rom and set BP at 0x100 and 0x101.
## Options
Run with the boot file (initial state is 0 for registers and memory):
`./driver.native --bootrom`
Set the speed per frame:
`./driver.native --speed 1.0`
Dump the state after each instruction
`./driver.native --di`
Run k steps
`./driver.native -k 0xbb34`
Hex dump and disassemble at offset
`./driver.native --hd --disas 0x100`