hep_filter is python script that intercepts daily abstract
announcements by arXiv.org,
calculates a score of each abstract according to keywords from
your scores file and than reorders abstracts so that the ones most
interesting to you come out on top. Resulting email is then forwarded to
you.
Download
Current version is hep_filter 0.6 [2007-08-24].
[download]
Installation
Firstly, download the hep_filter script somewhere and make it
executable. Then configure your scores file as explained below. Path to this
file should be set at the begining of the hep_filter script.
Note also that hep_filter mails resulting abstracts to you (address
specified at the beginning of the script) using your local SMTP server
"localhost". Edit hep_filter if you need to change this or if you
need to specify login and password.
Finally, you need to configure your email so that arXiv.org mailings get
redirected to hep_filter script. Ask your local email guru how to do
this. Let me just say that I use
procmail on Unix and the following
recipes in my .procmailrc file:
# If already hep_filtered hep-daily goes to INBOX for reading
:0:
* ^From.*hep_filter*
INBOX
# Otherwise filter it by hep_filter
:0
* ^From.*no-reply@arXiv.org*
| /path_to/hep_filter
Scores file
Syntax of this file is simple. Each line is of the form:
string -> score
Matrix theor% -> -3
orbifold -> -3
Witten -> +8
cosmological constant -> 1
revised version -> -5
Bell% inequalit% -> 3
All lines without "->" are ignored.
Percent sign is a wildcard matching anything until the end of the word.
You cannot use it in the middle of the word.
Pattern matching is case insensitive.
Matched strings will be enclosed within *-signs
in the abstracts. You can instruct your email reader to boldify
or colour these strings.
Testing
To see how it works without reconfiguring your email setup just save
one arXiv.org email to a file (preserving email headers!) and run a
"hep_filter < thisfile". SMTP server such as sendmail still needs to be
running.
Bugs
Patterns are recognized even in the middle of the word. So if you
don't want to read about Lie algebras and Lie derivatives and
score "Lie", script will also score words such as "implies".
|