R package for identifying text containing natural language (or not) using machine learning.
With devtools:
devtools::install_github("M3SOulu/NLoN")
library(NLoN)
model <- NLoNModel(nlon.data$text, nlon.data$rater2)
topredict <- c("This is natural language.", "public void NotNaturalLanguageFunction(int i, String s)")
NLoNPredict(model, topredict)
For more information on how to use this package, check out its vignette on GitHub or directly from R:
library(NLoN)
vignette("NLoN")
If you use our tool please cite our paper: Mäntylä M. V., Calefato F., Claes M, "Natural Language or Not (NLoN) - A Package for Software Engineering Text Analysis Pipeline", The 15th International Conference on Mining Software Repositories (MSR 2018), May 28--29, 2017, Gothenburg, Sweden, pp. 1-5 https://mmantyla.github.io//2018_Mantyla_MSR_natural-language-nlon.pdf