video-background

Products

Taalas HC1 Technology Demonstrator

  • Runs Llama 3.1 8B model
  • TSMC 6nm | 815mm2 | 53B Transistor
  • 2.5 kW Server
Taalas HC1 board

Instantaneous Inference

HC1 demonstrates the power of Taalas hardcore model silicon technology, delivering over 17k tokens per second per user on Llama 3.1 8B model.

Chart showing speed comparison between Taalas and competitors - tokens per second per user

Source: Model Llama 3.1 8B, Nvidia Baseline (H200), B200 measured by Taalas | Groq, Sambanova, Cerebras performance from Artificial Analysis | Taalas Performance run by Taalas labs | Input sequence length 1k/1k

Close

Join our team!

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Nullam placerat iaculis porta. Nam id blandit lectus. Vivamus at turpis eu dolor vulputate dignissim.

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Nullam placerat iaculis porta. Nam id blandit lectus. Vivamus at turpis eu dolor vulputate dignissim.

Send your CV

[contact-form-7 id="c1a6c82" title="Contact form"]

By submitting this form: You agree to the processing of the submitted personal data in accordance with our Privacy Policy, including the transfer of data to the United States.

Search

You are using an outdated browser which can not show modern web content.

We suggest you download Chrome or Firefox.