Products

Taalas HC1 Technology Demonstrator

Runs Llama 3.1 8B model
TSMC 6nm | 815mm2 | 53B Transistor
2.5 kW Server

Instantaneous Inference

HC1 demonstrates the power of Taalas hardcore model silicon technology, delivering 17k tokens per second per user on Llama 3.1 8B model.

Chart showing speed comparison between Taalas and competitors - tokens per second per user

Source: Model Llama 3.1 8B, Nvidia Baseline (H200), B200 measured by Taalas | Groq, Sambanova, Cerebras performance from Artificial Analysis | Taalas Performance run by Taalas labs | Input sequence length 1k/1k

Join our team!

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Nullam placerat iaculis porta. Nam id blandit lectus. Vivamus at turpis eu dolor vulputate dignissim.

Send your CV

[contact-form-7 id="c1a6c82" title="Contact form"]

By submitting this form: You agree to the processing of the submitted personal data in accordance with our Privacy Policy, including the transfer of data to the United States.

This website uses cookies to improve user experience. To learn more take a look at our Privacy policy.

By selecting "Accept cookies" on this banner, you agree to the use and storage of cookies on your device.