A Corpus Processing and Analysis Pipeline for Quickref

Main Authors: Antoine Hacquard, Didier Verna
Format: Proceeding Journal
Bahasa: eng
Terbitan: , 2021
Subjects:
Online Access: https://zenodo.org/record/4714443
Daftar Isi:
  • Quicklisp is a library manager working with your existing Common Lisp implementation to download and install around 2000 libraries, from a central archive. Quickref, an application itself written in Common Lisp, generates, automatically and by introspection, a technical documentation for every library in Quicklisp, and produces a website for this documentation. In this paper, we present a corpus processing and analysis pipeline for Quickref. This pipeline consists of a set of natural language processing blocks allowing us to analyze Quicklisp libraries, based on natural language contents sources such as README files, docstrings, or symbol names. The ultimate purpose of this pipeline is the generation of a keyword index for Quickref, although other applications such as word clouds or topic analysis are also envisioned.