You should be able to get sub-microsecond latency on a single pin (I did). Maybe post your code, it looks like something else in your code may be...
Thanks for posting your updates... I am also looking at utilizing the GPIO via mmap... how did you find that GPIO40 is GPIO02_IO8, I was looking...
Separate names with a comma.