TechGoogle Implements Multi-Token Prediction to Triple Gemma 4 Inference SpeedThe company introduced 'drafter' models that use speculative decoding to accelerate the Gemma 4 family of open AI models by up to 3x.20 minutes ago