IBM unveils AI programming training dataset

IBM expects to create a kind of analogue of ImageNet for intelligent development tools, which has actually become the standard set of images for training AI models. At the THINK conference, the company announced that it has collected a huge array of source codes for this.

The set, called Project CodeNet, contains 14 million samples with a total volume of 500 million lines of code in more than 55 programming languages: from Java, C and Go to COBOL, Pascal and FORTRAN. However, more than three quarters of all code is in C ++ and Python.

The source of the code was two Japanese programming contests: Aizu and AtCoder. According to the terms of the contests, participants had to write the code necessary to turn a given set of inputs into a set of desired outputs for 4000 different problems. Thus, 14 million code samples were obtained, about half of which turned out to be working, and the rest were marked as uncompiled, incorrect or containing errors.

IBM хочет, чтобы проект CodeNet пошёл по стопам ImageNet и стал де-факто стандартным набором данных для обучения ИИ-моделей, способных распознавать структуру программ. Предполагается, что CodeNet можно будет использовать для создания интеллектуальных инструментов разработки, осуществляющих поиск нужных процедур в приложениях и библиотеках, перевод с одного языка программирования на другой, выбор правильных реализаций и отсев ошибочных, классификацию кода и так далее.


If you notice an error, select it with the mouse and press CTRL + ENTER. | Can you write better? We are always glad to new authors.

A source:

Related Posts

Property Management in Dubai: Effective Rental Strategies and Choosing a Management Company

“Property Management in Dubai: Effective Rental Strategies and Choosing a Management Company” In Dubai, one of the most dynamically developing regions in the world, the real estate…

In Poland, an 18-year-old Ukrainian ran away from the police and died in an accident, – media

The guy crashed into a roadside pole at high speed. In Poland, an 18-year-old Ukrainian ran away from the police and died in an accident / illustrative…

NATO saw no signs that the Russian Federation was planning an attack on one of the Alliance countries

Bauer recalled that according to Article 3 of the NATO treaty, every country must be able to defend itself. Rob Bauer commented on concerns that Russia is…

The Russian Federation has modernized the Kh-101 missile, doubling its warhead, analysts

The installation of an additional warhead in addition to the conventional high-explosive fragmentation one occurred due to a reduction in the size of the fuel tank. The…

Four people killed by storm in European holiday destinations

The deaths come amid warnings of high winds and rain thanks to Storm Nelson. Rescuers discovered bodies in two separate incidents / photo ua.depositphotos.com Four people, including…

Egg baba: a centuries-old recipe of 24 yolks for Catholic Easter

They like to put it in the Easter basket in Poland. However, many countries have their own variations of “bab”. The woman’s original recipe is associated with…

Leave a Reply

Your email address will not be published. Required fields are marked *