Why are they only testing inference vs training?
Not many companies are going to want to deploy their own public-facing chatbot service. But almost everyone in this space is going to want to train their models, which is where the performance boost comes in.