I have a heavy algorithm with multiple inputs and one output. I want to launch parallel Kotlin coroutines (one a thread pool) so that each one runs one instance of the algorithm