A collection and filtering system for scientific papers and journal articles.
PDF files with a given name pattern are transformed to a searchable table of documents.
Sample directory structure:
.
├── pr
│ ├── Pattern recognition by means of disjoint principal components models - 1976 - 909c.pdf
│ ├── Pattern recognition by affine moment invariants - 1993 - 724c.pdf
│ └── 01_Learning hierarchical features for scene labeling - 2013 - 91c.pdf
├── segmentation
│ ├── 00_Object recognition from local scale-invariant features - 1999 - IEEE - 8369c.pdf
│ ├── Normalized Cuts and Image Segmentation - 2000 - 9351c.pdf
│ └── Comparing clusterings - an information based distance - 2007 - 510c.pdf
├── program.exe
└── readme.txt
The naming pattern of the PDF files has to be
[<rating>_]<title>[ - <year>][ - (<author>)][ - <publisher>][ - <number_of_citations>c].pdf
Where rating
ranges from top 00
to config_data['max_rating']
. This inverse naming scheme enhances readability in file listings such as ls.
A good estimate for the number of citations can be found at google scholar.