Skip to content
/ aot Public
forked from sokirko74/aot

Seman is a set of linguistic tools to analyze Russian or German texts, it contains lexicons and grammars. The project is interesting as a base line for many research projects in computer linguistics area.

License

Notifications You must be signed in to change notification settings

faustoFF/aot

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

====================== Introduction ====================

This is a RML package (RML is Russian acronym, stands for Linguistic Environment).

This program is distributed under the Library GNU Public Licence, which is in the file
COPYING.  

This program was  written by Alexey Sokirko, Igor Nozhov, Lev Gershenzon, Andrey Putrin 
and many other people.

The project started in Moscow in Dialing 
Company (Russian and English language). The German part was created  
at Berlin-Brandenburg Academy of Sciences and Humanities in  Berlin (the project DWDS). 

Russian website (Russian documents) is www.aot.ru.
More detail about Morphology dicts is here: Docs/Morph_UNIX.txt

==============  Prerequisites    ========================
0.   Install cmake (at least 3.5) from https://cmake.org/download/ and gzip (cygwin)
 
1.  g++ 9.0 or higher or  Microsoft Visual Studio 2019 

2. You need flex and bison to build projects after Morphology (Synan and Seman)
    sudo apt install flex bison

2.1 For windows I used flex and bison from https://sourceforge.net/projects/winflexbison/?source=typ_redirect
     mkdir RML/external
     mkdir RML/external/winflex
     cd RML/external/winflex
     wget https://sourceforge.net/projects/winflexbison/files/win_flex_bison-2.5.24.zip/download
     unzip win_flex_bison-2.5.24.zip
    

3. To build lemmatizer_python(switched off by default) you need swig and python-dev,
  3.1  for windows I used 
    https://sourceforge.net/projects/swig/files/swigwin/swigwin-3.0.12/swigwin-3.0.12.zip/download
    https://www.python.org/downloads/release/python-2715/
  3.2 To compile lemmatizer_python you need also 
    8.1 one must install swig (swig.org) and adjust path in lemmatizer_python/CMakeLists.txt
    8.2 and install python3 (with  debug binaries https://stackoverflow.com/questions/35250175/i-cannot-find-python35-d-lib)

 
5. To compile HTTP-daemons please install libevent 
        https://github.com/libevent/libevent:
   6.1. download source archive from  https://libevent.org/ , for example 
   6.2  wget https://github.com/libevent/libevent/releases/download/release-2.1.12-stable/c
   6.3  tar xfz libevent-2.1.12-stable.tar.gz
   6.4  mv libevent-2.1.12-stable/libevent-2.1.12-stable $RML/Source/contrib/libevent
   6.5  cd $RML/Source/contrib/libevent
   6.6  ./configure --disable-openssl
   6.7  make & sudo make install

   or
   sudo apt install libevent-dev
  
6. To compile COM-objects you need to run Visual Studio as administrator.

7. To run Windows GUI Rossdev.exe and MorphWizard you must set Russian everywhere in Region Setting.


=================   Compilation =======================
  export RML=/home/sokirko/RML
  export CXX=/usr/bin/g++
  export CC=/usr/bin/gcc
  cd $RML
  mkdir build
  cd build
  cmake ..
   // or with debug 
   // cmake .. -DCMAKE_BUILD_TYPE=Debug 
  make
  make DESTDIR=$RML install

About

Seman is a set of linguistic tools to analyze Russian or German texts, it contains lexicons and grammars. The project is interesting as a base line for many research projects in computer linguistics area.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • C++ 57.2%
  • HTML 30.7%
  • Tcl 7.0%
  • C 3.1%
  • Perl 0.6%
  • CMake 0.4%
  • Other 1.0%