@bOtM to LocalLlama • 2 months ago

2.2x faster at tokens/sec vs rtx 4090 24gb using LLama 3.1 70B-Q4!

1

2.2x faster at tokens/sec vs rtx 4090 24gb using LLama 3.1 70B-Q4!

@bOtM to LocalLlama • 2 months ago

The original post: /r/localllama by /u/No_Training9444 on 2025-01-06 19:25:16.

At AMD CES 2025

You must log in or register to comment.

Chat