Skip to content
@LanguageMachines

Language Machines

NLP Research group at Centre for Language Studies, Radboud University Nijmegen

Popular repositories Loading

  1. frog frog Public

    Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.

    C++ 75 11

  2. ucto ucto Public

    Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use…

    C++ 65 13

  3. PICCL PICCL Public

    A set of workflows for corpus building through OCR, post-correction and normalisation

    Python 48 6

  4. timbl timbl Public

    TiMBL implements several memory-based learning algorithms.

    C++ 46 9

  5. LuigiNLP LuigiNLP Public

    A workflow system for Natural Language Processing.

    Python 21 4

  6. libfolia libfolia Public

    FoLiA library for C++

    C++ 16 7

Repositories

Showing 10 of 54 repositories
  • libfolia Public

    FoLiA library for C++

    LanguageMachines/libfolia’s past year of commit activity
    C++ 16 GPL-3.0 7 5 0 Updated Nov 21, 2024
  • foliatest Public

    Test suite for libfolia

    LanguageMachines/foliatest’s past year of commit activity
    C++ 0 GPL-3.0 1 0 0 Updated Nov 20, 2024
  • foliautils Public

    Command-line utilities for working with the Format for Linguistic Annotation (FoLiA), powered by libfolia (C++), written by Ko van der Sloot (CLST, Radboud University)

    LanguageMachines/foliautils’s past year of commit activity
    C++ 4 GPL-3.0 3 8 0 Updated Nov 19, 2024
  • ticcutils Public

    Ticcutils, a generic utility library shared by our software.

    LanguageMachines/ticcutils’s past year of commit activity
    C++ 7 GPL-3.0 8 2 0 Updated Nov 19, 2024
  • dimbl Public

    Distributed Tilburg Memory Based Learner

    LanguageMachines/dimbl’s past year of commit activity
    C++ 2 GPL-3.0 2 0 0 Updated Nov 18, 2024
  • wopr Public

    Memory Based Word Predictor/Language Model http://ilk.uvt.nl/wopr/

    LanguageMachines/wopr’s past year of commit activity
    C++ 5 0 1 0 Updated Nov 18, 2024
  • ticcltools Public

    Tools for TICCL

    LanguageMachines/ticcltools’s past year of commit activity
    C++ 14 GPL-3.0 3 17 0 Updated Nov 18, 2024
  • frogtests Public

    Unit tests for Frog

    LanguageMachines/frogtests’s past year of commit activity
    Lex 0 0 1 0 Updated Nov 18, 2024
  • uctodata Public

    Datafiles for the tokenizer ucto.

    LanguageMachines/uctodata’s past year of commit activity
    Shell 9 GPL-3.0 5 3 0 Updated Nov 18, 2024
  • frogdata Public

    Data for Frog, mandatory

    LanguageMachines/frogdata’s past year of commit activity
    Lex 1 GPL-3.0 5 1 1 Updated Nov 18, 2024

Top languages

Loading…

Most used topics

Loading…