GitHub Java Corpus - Function Identifiers
Main Author: | Weyssow, Martin |
---|---|
Format: | info dataset Journal |
Terbitan: |
, 2020
|
Subjects: | |
Online Access: |
https://zenodo.org/record/4266099 |
Daftar Isi:
- This dataset contains function identifiers extracted from the GitHub Java Corpus (http://groups.inf.ed.ac.uk/cup/javaGithub/). Each line corresponds to a method declaration. A line contains the name of the method declaration followed by the function identifiers (i.e., function calls) contained within the method body. The file embeddings_train.json can be used to train a word/sentence embedding model using the code in the Github repository (link below). The corpus was used for the experiments in the paper Combining Code Embedding with Static Analysis for Function-Call Completion. Github repository to replicate the experiments: https://github.com/mweyssow/cse-saner