- Dataset
- ROUGE
- pyquery (xml parsing)
- nltk (word tokenization)
- pandas - ashkonf_pagerank
- NetworkX (for textrank)
- DIVIDE DATA
- PARSER
- ...
- TEXTRANK
- ashkonf_pagerank
- timothyasp_pagerank
- wordentail (from class)
- distributedwordreps.py (from class)
#!/usr/bin/env python
# ------------------------------------
# Args:
[1] file.txt
[2] flag: [1, 3]
# EG: python example.py test.txt 1
# ------------------------------------
# Description ...
# ------------------------------------
import os import sys import csv import copy import random import pickle import itertools from operator import itemgetter from collections import defaultdict
import numpy as np import scipy import scipy.spatial.distance import sklearn.metrics from numpy.linalg import svd from collections import defaultdict
#!/bin/bash
Useful shit: http://kavita-ganesan.com/rouge-howto Check for a module's installation path: perldoc -l XML::DOM
TO RUN ROUGE: perl ROUGE-1.5.5.pl -e data -f A -a -x -s -m -2 -4 -u text_summarizer/settings.xml