Grok-2 gets a speed bump after developers rewrite code

August 24, 2024

17

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More

Elon Musk’s xAI has made waves in the last week with the release of its Grok-2 large language model (LLM) chatbot — available through an $8 USD monthly subscription on the social network X.

Now, both versions of Grok-2 — Grok-2 and Grok-2 mini, the latter designed to be less powerful but faster — have both increased the speed at which they can analyze information and output responses after two developers at xAI rewrite the inference code stack completely in the last three days.

As xAI developer Igor Babuschkin posted this afternoon on the social network X under his handle @ibab:

“Grok 2 mini is now 2x faster than it was yesterday. In the last three days @lm_zheng and @MalekiSaeed rewrote our inference stack from scratch using SGLang. This has also allowed us to serve the big Grok 2 model, which requires multi-host inference, at a reasonable speed. Both models didn’t just get faster, but also slightly more accurate. Stay tuned for further speed improvements!”

Grok-2 gets a speed bump after developers rewrite code

Grok-2 and Grok-2-Mini Performance Highlights

Future Developments

Related Articles

To Count or Not to Count?

As Sri Lanka votes, a $2.9bn IMF loan looms large | Elections

‘Arcane’ Final Season Premiere Date Set; Sophomore Run Split Into Three Acts

LEAVE A REPLY Cancel reply

Latest Articles

To Count or Not to Count?

As Sri Lanka votes, a $2.9bn IMF loan looms large | Elections

‘Arcane’ Final Season Premiere Date Set; Sophomore Run Split Into Three Acts

Inspire Investing Settles SEC Charges It Misled Clients

Pelosi: Trump Doesn’t Have the ‘Sanity’ to Be President