Thank you so much, you helped me to narrow down the issue I thought about memory, parallelization ..
I solved it! you are right it is in the subroutine,
I found a mistake in the interface when I copied it from abaqus documentation
I forgot to change strain by stress: field(nblock, nfieldv)...