pynzbdex

Service Dependencies

Riak: document storage db for handling NZBs, header info.
Redis: highspeed messaging passing/queing system for notifying population script
MySQL: stores article relational data (Files, Reports, etc. )
Binary Library Dependencies
Protobuf: riak client requires protobuf is installed
Arch: protobuf-2.4.1-2 or newer (community/protobuf)
Python Library Dependencies
For Arch, better to install dist for these:
- extra/python2-lxml
Until SQL alchemy bumps their official version to 0.8: http://pypi.python.org/pypi/SQLAlchemy/0.8.0b2
See requirements.txt
Config Dependencies:
riakkit: search must be enabled on bitcask, indexes must be installed for each bucket: search-cmd install group search-cmd install article
beam modules: beam modules must be compiled in the regular way and installed for Riak https://github.com/basho/riak_kv/blob/master/src/riak_kv_mapreduce.erl https://github.com/basho/riak_function_contrib/blob/master/mapreduce/erlang/delete_keys.erl
mysql: expects all schemas to be UTF8
redis: runs in cache mode (implicit TTL) #maxmemory 32mb #maxmemory-policy allkeys-lru maxmemory 256mb maxmemor-policy never-expire

Service Architecture

[usenet provider] ---> [pynntpprox] ---> [pynzbdex/cmddex] ---> [riak]   (full document)
                                                           ---> [redis]  (subset of document:  mesg_id:{ from, subject, date, newsgroups })

        ...

[redis] ---> [pynzbdex/cmdpop] ---> [mysql] (decompose redis msg [pickle string] into standard relational model, [or mark as deleted])

Assembly Strategies and Notes

pynzbdex necessarily groups articles together that comprise uploaded files. it follow's Perl's aub's strategies for indirectly identifying alike files.

Skipped if:

null subject or subject begins with Re:
any subject with .htm or more than one !(exclaimation point)

Included if:

IFF subject matches: ( $subject =~ m/^(.)^\d/(1)[^\d]/i ) || ( $subject =~ m/^(.\D)(\d+)\s*/\s*(\d+)/ ) || ( $subject =~ m/^(.\D)(\d+)\s|\s*(\d+)/ ) || ( $subject =~ m/^(.\D)(\d+)\s\\s*(\d+)/ ) || ( $subject =~ m/^(.\D)(\d+)\so\sf\s(\d+)/i ) || ( $subject =~ m/^(.\D)(\d+)\sf\so\s(\d+)/i )
subject includes filename extensions: these "hints" are used to help identify, not required like the previous item. (".gif", ".jpg", ".jpeg", ".gl", ".zip", ".au", ".zoo", ".exe", ".dl", ".snd", ".mpg", ".mpeg", ".tiff", ".lzh", ".wav" )

#CREATE DATABASE pynzbdex /*!40100 DEFAULT CHARACTER SET utf8 COLLATE utf8_unicode_ci */

#subj_key varchar(767) CHARACTER SET latin1 DEFAULT NULL AFTER from_

UNIQUE KEY `subj_key` (`subj_key`),

from pynzbdex import storage from sqlalchemy.schema import CreateTable print CreateTable(storage.sql.File.table)

-> need new scanner that just sees if there is a riak article backing the sql article.. simple loop over all articles...

-> factor out article deletion cascade/count

-> consider refactoring aggregator to make each scanner/processor class-based

-> fix "index completion" graphic -> check math, etc.. -> files -> # of file records wrt # of articles grouped by association -> reports -> same but for report -> file

/etc/profile

addl custom

PATH="$PATH:/opt/riak/bin" export PATH

search-cmd install article search-cmd install group

search-cmd set-schema article schema/article.erl search-cmd set-schema group schema/group.erl

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
pynzbdex		pynzbdex
riak		riak
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pynzbdex

Service Dependencies

Service Architecture

Assembly Strategies and Notes

UNIQUE KEY `subj_key` (`subj_key`),

addl custom

About

Releases

Packages

Contributors 2

Languages

sbussetti/pynzbdex

Folders and files

Latest commit

History

Repository files navigation

pynzbdex

Service Dependencies

Service Architecture

Assembly Strategies and Notes

UNIQUE KEY subj_key (subj_key),

addl custom

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

UNIQUE KEY `subj_key` (`subj_key`),

Packages