Underwhelming DMA¶
Published on 2019-04-19 in µGame.
I finally worked on an improvement that I long wanted to do to the _stage library, which is at the heart of µGame: make the communication with the display asynchronous by using DMA. This way I can have two buffers, and calculate the contents of one of them while the other one is being sent to the display.
As a test I used the balls demo from the tutorial, but with all waiting for the next frame removed — so they move as fast as they can.
Here is how it works before DMA:
data:image/s3,"s3://crabby-images/52b4d/52b4dee3dd9f4e601d65e12d87803a35561f4cf7" alt="../../../_images/4838621555700681233.gif"
And here is how it works with DMA:
data:image/s3,"s3://crabby-images/8ef59/8ef59b116931cf6e8fa628ff2e3b9592722a778b" alt="../../../_images/1409261555700732524.gif"
As you can see, there is barely any perceptible difference at all!
The code I wrote for this is pretty nasty and hacky, and after seeing this, I don’t think I have the heart to clean it up and send it as a pull request — it’s simply not worth it.
However, I will keep an eye for other ways of improving the firmware.
UPDATE: Yes, I made some more precise measurements as well. Running through 1000 frames of the animation took the DMA version 31.034s and the non-DMA version 32.699s. That is 32.22 fps with DMA and 30.58 fps without.
UPDATE2: I did a second test, with full-screen updates this time. 159.08s for non-DMA, versus 148.708s for DMA. That’s 6.28 fps versus 6.72 fps. Even with such a large amount of data, the difference is negligible compared to the time spent computing it.