Dwi2fod Multishell "single-tissue" versus "multi-tissue" CSD

Yes, it is expected to be considerably different. I strongly advise against using single-tissue CSD with multi-shell data: your fODFs will actually degrade compared to single-tissue CSD with single-shell data and you will also not see any of the benefits of multi-compartment modelling as shown in Multi-tissue constrained spherical deconvolution for improved analysis of multi-shell diffusion MRI data - PubMed.

If you have 16 threads at your disposal, the processing time of dwi2fod with three tissue types can’t be that long? Especially if you compare it to steps like motion and eddy current correction, dwi2fod running time should be minor (see https://www.sciencedirect.com/science/article/abs/pii/S1053811919307281, for typical runtimes). What is your spatial resolution?