This was a very interesting read. I tried very basic translation myself with chat gpt before although I was weary of it's reliably. This however makes me more hopeful.
I'm wondering about the randomness issue, are you only using the web interface and not the API? Couldn't you set the temp=0 to get the output to be more reliable? (I'm assuming you most likely don't have chat gpt 4 API access so I know you would have to wait for that.)