▲ | denysvitali 7 days ago | ||||||||||||||||||||||||||||||||||||||||||||||
Report: https://github.com/swiss-ai/apertus-tech-report/raw/refs/hea... Key features Fully open model: open weights + open data + full training details including all data and training recipes Massively Multilingual: 1811 natively supported languages Compliant: Apertus is trained while respecting opt-out consent of data owners (even retrospectivey), and avoiding memorization of training data | |||||||||||||||||||||||||||||||||||||||||||||||
▲ | lyu07282 6 days ago | parent | next [-] | ||||||||||||||||||||||||||||||||||||||||||||||
Their struggle with Nvidia driver bugs they had to work around was very relatable. You'd think if someone buys 10,752 of their high-end GPUs you'd get some support with it. | |||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||
▲ | Bromeo 7 days ago | parent | prev | next [-] | ||||||||||||||||||||||||||||||||||||||||||||||
Looks like the performance is pretty decent, somewhere around Llama3.1 for general knowledge (Tables 17) but still a bit behind in Code and Reasoning (Table 18). Llama3.1 was released about one year ago. | |||||||||||||||||||||||||||||||||||||||||||||||
▲ | esafak 4 days ago | parent | prev [-] | ||||||||||||||||||||||||||||||||||||||||||||||
There's an interesting "Swiss AI Charter" on pg. 107. |