▲ | lllllm 4 days ago | ||||||||||||||||||||||
benchmarks: we provide plenty in the over 100 page tech report here https://github.com/swiss-ai/apertus-tech-report/blob/main/Ap... quantizations: available now in MLX https://github.com/ml-explore/mlx-lm (gguf coming soon, not trivial due to new architecture) model sizes: still many good dense models today lie in the range between our small and large chosen sizes | |||||||||||||||||||||||
▲ | dcreater 4 days ago | parent [-] | ||||||||||||||||||||||
Thank you! Why are the comparisons to llama3.1 era models? | |||||||||||||||||||||||
|