Pretty nice +1. To get things really fast you could try CRAYing in and out INST directly instead of PCSN, so the connection from input to output is only INST without any PSCN or NSCN. This might give you some hints id:2946121 (the second design from the top).