forked from sokirko74/aot
-
Notifications
You must be signed in to change notification settings - Fork 0
Seman is a set of linguistic tools to analyze Russian or German texts, it contains lexicons and grammars. The project is interesting as a base line for many research projects in computer linguistics area.
License
faustoFF/aot
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
====================== Introduction ==================== This is a RML package (RML is Russian acronym, stands for Linguistic Environment). This program is distributed under the Library GNU Public Licence, which is in the file COPYING. This program was written by Alexey Sokirko, Igor Nozhov, Lev Gershenzon, Andrey Putrin and many other people. The project started in Moscow in Dialing Company (Russian and English language). The German part was created at Berlin-Brandenburg Academy of Sciences and Humanities in Berlin (the project DWDS). Russian website (Russian documents) is www.aot.ru. More detail about Morphology dicts is here: Docs/Morph_UNIX.txt ============== Prerequisites ======================== 0. Install cmake (at least 3.5) from https://cmake.org/download/ and gzip (cygwin) 1. g++ 9.0 or higher or Microsoft Visual Studio 2019 2. You need flex and bison to build projects after Morphology (Synan and Seman) sudo apt install flex bison 2.1 For windows I used flex and bison from https://sourceforge.net/projects/winflexbison/?source=typ_redirect mkdir RML/external mkdir RML/external/winflex cd RML/external/winflex wget https://sourceforge.net/projects/winflexbison/files/win_flex_bison-2.5.24.zip/download unzip win_flex_bison-2.5.24.zip 3. To build lemmatizer_python(switched off by default) you need swig and python-dev, 3.1 for windows I used https://sourceforge.net/projects/swig/files/swigwin/swigwin-3.0.12/swigwin-3.0.12.zip/download https://www.python.org/downloads/release/python-2715/ 3.2 To compile lemmatizer_python you need also 8.1 one must install swig (swig.org) and adjust path in lemmatizer_python/CMakeLists.txt 8.2 and install python3 (with debug binaries https://stackoverflow.com/questions/35250175/i-cannot-find-python35-d-lib) 5. To compile HTTP-daemons please install libevent https://github.com/libevent/libevent: 6.1. download source archive from https://libevent.org/ , for example 6.2 wget https://github.com/libevent/libevent/releases/download/release-2.1.12-stable/c 6.3 tar xfz libevent-2.1.12-stable.tar.gz 6.4 mv libevent-2.1.12-stable/libevent-2.1.12-stable $RML/Source/contrib/libevent 6.5 cd $RML/Source/contrib/libevent 6.6 ./configure --disable-openssl 6.7 make & sudo make install or sudo apt install libevent-dev 6. To compile COM-objects you need to run Visual Studio as administrator. 7. To run Windows GUI Rossdev.exe and MorphWizard you must set Russian everywhere in Region Setting. ================= Compilation ======================= export RML=/home/sokirko/RML export CXX=/usr/bin/g++ export CC=/usr/bin/gcc cd $RML mkdir build cd build cmake .. // or with debug // cmake .. -DCMAKE_BUILD_TYPE=Debug make make DESTDIR=$RML install
About
Seman is a set of linguistic tools to analyze Russian or German texts, it contains lexicons and grammars. The project is interesting as a base line for many research projects in computer linguistics area.
Resources
License
Stars
Watchers
Forks
Packages 0
No packages published
Languages
- C++ 57.2%
- HTML 30.7%
- Tcl 7.0%
- C 3.1%
- Perl 0.6%
- CMake 0.4%
- Other 1.0%