(This post is part of a series on the subject of my hobby project, which is recreating the C source code for the 1989 game F-15 Strike Eagle II by reverse engineering the original binaries.)
This is just a short update to share a significant milestone: the first reconstructed executable (START.EXE) is now playable in the original game.
I “finished” the reconstruction a few months ago, meaning all of the code that was generated from C source has been transcribed back into identical C. I left the assembly routines as is (meaning as generated from the IDA liisting), except for some variables having meaningful names, and some comments, both carried over from research done in IDA. Also, the contents of the data segment have been (and still are) generated from assembly. I am not sure how the reconstruction will behave after these are moved back to C, but probably there will be some fallout in the form of bugs to fix. I have strived to replace all hard-baked offsets with references to symbols, but still something might have slipped through the cracks.
In any case, I was pretty surprised that the reconstruction did not run given that the code was “identical”, as attested by my mzdiff tool. But the thing is, the tool cannot tell for instructions like mov ax, 0x1234 if the immediate value is some computational constant, or an offset of a variable, so it does not consider immediate value differences as straight-up mismatches, period. This has the potential to backfire badly, and after I had fixed the big problem with the incorrect value being set for the data segment in the previous post, most problems turned out to have been caused by typos where I put an incorrect immediate value, and mzdiff ignored the difference. These were usually small numbers, so I actually put in a silly heuristic into the latest mzdiff to highlight instructions differing on immediate values in bright red as a warning if the value of the immediate is less than 0xff. It actually came pretty handy and I was able to find a bunch more that needed resolving.