GPT-SW3

submited by
Style Pass
2023-01-24 20:30:07

GPT-SW3 is the first truly large-scale generative language model for the Swedish language. Based on the same technical principles as the much-discussed GPT-3, GPT-SW3 will help Swedish organizations build language applications never before possible.

The pre-release is an important step in the process of knowledge building and validating the model (with 126M, 356M, 1.3B, 6.7B, 20B) and collecting feedback on both what works well and what does not.

The models are accessible in a private repository under a modified RAIL license on Hugging Face, where we also provide both a model card and a datasheet - please note that you will need significant computation power. In order to access the repository and use the model you need to apply using this form. All applicants will have to approve the license and go through manual approval before the model is provided. The pre-release is intended for organizations and individuals in the Nordic NLP ecosystem.

Join the conversation on Discord and reach out to Francisca Hoyer to share your learnings! And keep an eye out for workshops and seminars for deep dives into use cases and collective problem solving.

Leave a Comment