AI & RoboticsNews

Facebook open-sources data set for code search AI benchmark

Facebook AI researchers created code search data sets that utilize information from GitHub and Stack Overflow. The release contains an evaluation data set of 287 Stack Overflow question-and-answer pairs including code snippets, as well as a search corpus of code snippets from nearly 25,000 Android repositories on GitHub.

The Neural Code Search Evaluation Data Set was published on arXiv in August and revised Wednesday. The Stack Overflow data comes from the Stack Overflow Data Dump, while the GitHub Rest API supplied the rest of the data.

“We intend for this data set to serve as a benchmark for evaluating search quality across a variety of code search models,” Facebook AI said in a blog post.

The paper also shares results of two AI models created by Facebook as a test run of the corpus and data set.

Code search is meant to give developers a way to surface chunks of programming language code using natural language. A number of code search initiatives are underway such as GitHub’s Semantic Code Project and machine learning initiative and startups like recent Y Combinator graduate Metacode.

In other developments in AI for software developers, this spring Google Brain introduced AI that predicts code based on previous edits.


Author: Khari Johnson
Source: Venturebeat

Related posts
GamingNews

Nintendo Has Replaced Samus' Voice Actor For Metroid Prime 4, So It's No Longer Mass Effect's Jennifer Hale Doing the Grunts

GamingNews

The Physics Inside a Black Hole Are Still a Mystery in the 41st Millennium, According to a New Warhammer 40,000 Novel — Even to the Necrons

GamingNews

Mario Kart World Update 1.4.0 Tweaks Track Layouts, Adds Custom Item Rules, Now Lets You See What Music is Playing

CryptoNews

Vanguard’s Massive Crypto Reversal Triggers ‘Highly Bullish’ Mainstream Momentum

Sign up for our Newsletter and
stay informed!