Dolly 2.0 is the first major open source language model with a free dataset for commercial use

American enterprise software company Databricks has released Dolly 2.0, the next version of its large language model (LLM) with similar features to ChatGPT. This is the first LLM with open source code and a set of freely accessible training instructions that will help companies to use AI technology for their own commercial projects without having to pay for APIs or share data with third parties.

Course

Cinema 4D course

You will have free access to C4D. We took care of everything

Sign upCow

In recent months, a number of language models similar to OpenAI’s GPT have been released, which by many definitions could be considered open. One such is Meta’s LLaMA, which in turn was inspired by Alpaca, Koala, Vicuna and Dolly 1.0

However, many of these “open” models were under the control of system developers – for example, the Alpaca team’s AI project at Stanford, which was trained on GPT-3.5 instructions and built on LLaMA 7B. OpenAI’s terms of use include a rule that researchers cannot use products from systems that compete with the company.

Databricks aims to solve this problem. Dolly 2.0 is a large language model with 12 billion parameters, based on the open source Eleuther family of artificial intelligence models and tuned exclusively to a small block of instructions (databricks-dolly-15k) created by the Databricks team. The license terms of this dataset allow you to use, modify and extend it for any purpose, including academic or commercial applications.

The Databricks blog points out that, like the original Dolly, version 2.0 isn’t state-of-the-art, but “shows a surprisingly efficient level of instruction execution given the size of the training block.” The report adds that the level of effort and expense required to create powerful artificial intelligence technologies is “significantly less than previously thought”.

The Dolly 2.0 model can be downloaded from the Databricks Hugging Face page, and the instructions are available from GitHub. The company also offers to attend its webinar on April 25, which will explain how organizations can use the LLM.

Related Posts

photobank launches its own generator of licensed images

The Technology section is powered by Favbet Tech The tool creates exclusive, ethical and most importantly licensed content that can be used in the future without any…

Shadow of Chernobyl was recreated in Unreal Engine 5

Ukrainian 3D artist Oleg Sobovyi presented the project Stalker Bunker from the famous game STALKER: Shadow of Chernobyl. Practically, the project reproduces Sidorovich’s bunker from the game,…

Twenty-second: Stories of Underground Kharkiv

The Ukrainian game Twenty-second: Stories of Underground Kharkiv by Brenntkopf Studio Kharkiv has been released on Steam. This is a quest and a visual novel that tells…

The Cabinet of Ministers has identified priority industries and plans to increase the number of AI developers

The Cabinet of Ministers of Ukraine has identified priority sectors of the economy for the use of artificial intelligence and plans to increase the number of companies…

The Witcher 3 Free Mod Editor by CDPR – Steam Page and Beta Test Details

The free mod editor for The Witcher 3: Wild Hunt, which CDPR announced at the end of last year, is getting closer to release – The Witcher…

Multi-sharing of documents will appear in the Action program

The Cabinet of Ministers adopted a resolution on multi-sharing of documents in the Diya application. This decision should significantly simplify and speed up the document flow for…

Leave a Reply

Your email address will not be published. Required fields are marked *