I have the exact same problem using the mcc (a slowdown of approximately 4 times). I ended up rewriting the functions in C and then compiling them into MEX, which resulted in a 50 times speed-up. I'd really appreciate anyone's input on this. Thanks,
Petr