Well I don't know about phone chips but using the metal api the cores in the m1 and up chips are fast enough to run 7B models nearly as fast as gpt4 runs.
Well I don't know about phone chips but using the metal api the cores in the m1 and up chips are fast enough to run 7B models nearly as fast as gpt4 runs.