summaryrefslogtreecommitdiff
path: root/mknes.c
Commit message (Collapse)AuthorAgeFilesLines
* workingHEADmasterPeter Fors2025-12-051-12/+21
|
* fix multiple mappersPeter Fors2025-11-021-15/+35
|
* some tas tests, desync is happeningPeter Fors2025-10-311-22/+65
|
* some optimization tests that didn't pan outPeter Fors2025-10-301-2/+2
|
* Finally back above 3000fps - this was a lot of workPeter Fors2025-10-291-4/+4
| | | | | | | | | | | | | | | | | | | | vital@claybabble:/work/current/mknes(master*|u=)$ ./Bench.sh Performance counter coverage: 100% (no multiplexing - full precision) 352509230343 instructions # 4.81 insn per cycle # 0.01 stalled cycles per insn 73277044838 cycles # 5.416 GHz 2957685039 stalled-cycles-frontend # 4.04% frontend cycles idle 70065301653 branches # 5.179 G/sec 297927451 branch-misses # 0.43% of all branches Throughput: 26056.16 MIPS, 5416.36 Mcycles/sec cycles/frame mean= 1788990 sd= 2803 relSD=0.157% n=10 insn/frame mean= 8606182 sd= 0 relSD=0.000% n=10 time (ms) mean= 1352.883 sd= 2.140 relSD=0.158% n=10 FPS (frames/second) = 3027.61 ms/frame = 0.330294
* reached 2944fps againPeter Fors2025-10-281-2/+2
|
* small optimizations of sprite evaluation in ppu_render_pixelPeter Fors2025-10-281-3/+3
|
* Changes to layoutPeter Fors2025-10-261-1/+0
|
* Refactor benchmarking to self-contained C implementationPeter Fors2025-10-251-29/+26
| | | | | | | | | | | | | - Add mknes_bench.c with direct PMC access via perf_event_open() - Remove dependency on external perf/awk for statistics - Add RT priority and CPU affinity control in C code - Use static BSS allocation (page-aligned) instead of malloc - Add stalled-cycles-backend counter (gracefully handles AMD unavailability) - Add throughput metrics (MIPS, Mcycles/sec) - Optimize Bench.sh to only regenerate profile data when needed - Add -n and -f flags for configurable runs and frames - Suppress mapper messages during benchmark - ~6x faster benchmark workflow (20s first run, 16s subsequent)
* Better benchmarking, some small optimizationsPeter Fors2025-10-251-8/+26
|
* optimized, but bug with rasterdemosPeter Fors2025-10-161-27/+28
|
* Rearrangement and refactoring and optimizations and more accuracyPeter Fors2025-10-091-147/+244
|
* testsPeter Fors2025-06-081-2/+14
|
* added scaffolding for apuPeter Fors2025-06-041-3/+3
|
* optimization of ppuPeter Fors2025-05-291-1/+1
|
* added mapper_tick() functionality, regressed 200fps, and optimized it back ↵Peter Fors2025-05-291-5/+0
| | | | up to ~1940fps
* renamingPeter Fors2025-05-271-50/+50
|
* 3011fpsPeter Fors2025-05-241-19/+42
|
* new and changed mappersPeter Fors2025-05-081-128/+71
|
* cleanup and rewrite of memory_read()Peter Fors2025-05-031-2/+2
|
* cleanup and rewrite of ppu_registers.cPeter Fors2025-05-021-5/+3
|
* reverted rewrite of ppu, optimized what functions should be forced inline, ↵Peter Fors2025-04-281-2/+2
| | | | gained ~2.5% performance
* before ppu rearrangement and fix of scanline 261Peter Fors2025-04-281-12/+14
|
* add -mtune=genericPeter Fors2025-04-211-6/+6
|
* back to 2560fps after wrangling the ppu_state into two cachelinesPeter Fors2025-04-211-4/+11
|
* Battletoads workingPeter Fors2025-04-161-17/+51
|
* Before partial rewrite of ppu_tick()Peter Fors2025-04-111-9/+8
|
* Last few optimizations to ppu_render_pixel, down to 1.59ns per pixelPeter Fors2025-04-091-1/+1
|
* 2690fpsPeter Fors2025-04-081-5/+5
|
* cleanup and optimization tests, unfortunately nothing great.Peter Fors2025-04-071-4/+7
|
* almost back to normal after bytestream excursionPeter Fors2025-04-071-3/+14
|
* cleanupPeter Fors2025-04-061-17/+17
|
* optimized away an if(dot == 256) in the hot path ~5% improvementPeter Fors2025-04-061-2/+2
|
* added more mappers (buggy)Peter Fors2025-04-061-67/+72
|
* everything working, 2285 fpsPeter Fors2025-04-051-34/+78
|
* reverted switch to switch, as it was slowerPeter Fors2025-04-051-3/+3
|
* transform to switch case for ppu_tick()Peter Fors2025-04-051-15/+21
|
* move read/write prg/chr/cirom data to mapperPeter Fors2025-04-041-22/+22
|
* Move to glfwPeter Fors2025-04-031-124/+166
|
* add mapper #66 (GXROM)Peter Fors2025-04-021-2/+4
|
* Itsa twerking.Peter Fors2025-04-021-2/+15
|
* skinny_version working horizontallyPeter Fors2025-04-021-5/+3
|
* semi working statePeter Fors2025-04-021-2/+2
|
* currently 90% workingPeter Fors2025-03-311-2/+61
|
* first mknes commitPeter Fors2025-03-291-0/+102