I am using SpamAssassin 3.1.0 on Perl 5.8.6 on a linux Fedora box.
when I feed sa-learn a new spam message, sometimes I get the following
report:
Learned tokens from 0 message(s) (0 message(s) examined)
Should I see some "real" numbers here, like 1 and 1?
The command I am using is
sa-learn -D --showdots --mbox --spam $FILE
Is this normal? Should I be concerned?
Skip
Just to be complete, here is a sample debug output:
[26075] dbg: logger: adding facilities: all
[26075] dbg: logger: logging level is DBG
[26075] dbg: generic: SpamAssassin version 3.1.0
[26075] dbg: config: score set 0 chosen.
[26075] dbg: util: running in taint mode? yes
[26075] dbg: util: taint mode: deleting unsafe environment variables,
resetting PATH
[26075] dbg: util: PATH included '/usr/bin', keeping
[26075] dbg: util: PATH included '/bin', keeping
[26075] dbg: util: final PATH set to: /usr/bin:/bin
[26075] dbg: dns: is Net:NS::Resolver available? yes
[26075] dbg: dns: Net:NS version: 0.49
[26075] dbg: dns: name server: 192.168.1.10, family: 2, ipv6: 0
[26075] dbg: config: using "/etc/mail/spamassassin" for site rules pre
files
[26075] dbg: config: read file /etc/mail/spamassassin/init.pre
[26075] dbg: config: read file /etc/mail/spamassassin/v310.pre
[26075] dbg: config: using "/usr/share/spamassassin" for sys rules pre
files
[26075] dbg: config: using "/usr/share/spamassassin" for default rules
dir
[26075] dbg: config: read file /usr/share/spamassassin/10_misc.cf
[26075] dbg: config: read file
/usr/share/spamassassin/20_advance_fee.cf
[26075] dbg: config: read file
/usr/share/spamassassin/20_anti_ratware.cf
[26075] dbg: config: read file
/usr/share/spamassassin/20_body_tests.cf
[26075] dbg: config: read file
/usr/share/spamassassin/20_compensate.cf
[26075] dbg: config: read file
/usr/share/spamassassin/20_dnsbl_tests.cf
[26075] dbg: config: read file /usr/share/spamassassin/20_drugs.cf
[26075] dbg: config: read file
/usr/share/spamassassin/20_fake_helo_tests.cf
[26075] dbg: config: read file
/usr/share/spamassassin/20_head_tests.cf
[26075] dbg: config: read file
/usr/share/spamassassin/20_html_tests.cf
[26075] dbg: config: read file
/usr/share/spamassassin/20_meta_tests.cf
[26075] dbg: config: read file /usr/share/spamassassin/20_net_tests.cf
[26075] dbg: config: read file /usr/share/spamassassin/20_phrases.cf
[26075] dbg: config: read file /usr/share/spamassassin/20_porn.cf
[26075] dbg: config: read file /usr/share/spamassassin/20_ratware.cf
[26075] dbg: config: read file /usr/share/spamassassin/20_uri_tests.cf
[26075] dbg: config: read file /usr/share/spamassassin/23_bayes.cf
[26075] dbg: config: read file /usr/share/spamassassin/25_accessdb.cf
[26075] dbg: config: read file /usr/share/spamassassin/25_antivirus.cf
[26075] dbg: config: read file
/usr/share/spamassassin/25_body_tests_es.cf
[26075] dbg: config: read file
/usr/share/spamassassin/25_body_tests_pl.cf
[26075] dbg: config: read file /usr/share/spamassassin/25_dcc.cf
[26075] dbg: config: read file
/usr/share/spamassassin/25_domainkeys.cf
[26075] dbg: config: read file /usr/share/spamassassin/25_hashcash.cf
[26075] dbg: config: read file /usr/share/spamassassin/25_pyzor.cf
[26075] dbg: config: read file /usr/share/spamassassin/25_razor2.cf
[26075] dbg: config: read file /usr/share/spamassassin/25_replace.cf
[26075] dbg: config: read file /usr/share/spamassassin/25_spf.cf
[26075] dbg: config: read file /usr/share/spamassassin/25_textcat.cf
[26075] dbg: config: read file /usr/share/spamassassin/25_uribl.cf
[26075] dbg: config: read file /usr/share/spamassassin/30_text_de.cf
[26075] dbg: config: read file /usr/share/spamassassin/30_text_fr.cf
[26075] dbg: config: read file /usr/share/spamassassin/30_text_it.cf
[26075] dbg: config: read file /usr/share/spamassassin/30_text_nl.cf
[26075] dbg: config: read file /usr/share/spamassassin/30_text_pl.cf
[26075] dbg: config: read file
/usr/share/spamassassin/30_text_pt_br.cf
[26075] dbg: config: read file /usr/share/spamassassin/50_scores.cf
[26075] dbg: config: read file /usr/share/spamassassin/60_awl.cf
[26075] dbg: config: read file /usr/share/spamassassin/60_whitelist.cf
[26075] dbg: config: read file
/usr/share/spamassassin/60_whitelist_spf.cf
[26075] dbg: config: read file
/usr/share/spamassassin/60_whitelist_subject.cf
[26075] dbg: config: using "/etc/mail/spamassassin" for site rules dir
[26075] dbg: config: read file /etc/mail/spamassassin/local.cf
[26075] dbg: config: using "/home/skip/.spamassassin/user_prefs" for
user prefs file
[26075] dbg: config: read file /home/skip/.spamassassin/user_prefs
[26075] dbg: plugin: loading Mail::SpamAssassin::Plugin::URIDNSBL from
[at] INC
[26075] dbg: plugin: registered
Mail::SpamAssassin::Plugin::URIDNSBL=HASH(0xa46411c)
[26075] dbg: plugin: loading Mail::SpamAssassin::Plugin::Hashcash from
[at] INC
[26075] dbg: plugin: registered
Mail::SpamAssassin::Plugin::Hashcash=HASH(0xa483478)
[26075] dbg: plugin: loading Mail::SpamAssassin::Plugin::SPF from [at] INC
[26075] dbg: plugin: registered
Mail::SpamAssassin::Plugin::SPF=HASH(0xa4a359c)
[26075] dbg: plugin: loading Mail::SpamAssassin::Plugin::Pyzor from
[at] INC
[26075] dbg: pyzor: network tests on, attempting Pyzor
[26075] dbg: plugin: registered
Mail::SpamAssassin::Plugin::Pyzor=HASH(0xa4b774c)
[26075] dbg: plugin: loading Mail::SpamAssassin::Plugin::SpamCop from
[at] INC
[26075] dbg: reporter: network tests on, attempting SpamCop
[26075] dbg: plugin: registered
Mail::SpamAssassin::Plugin::SpamCop=HASH(0xa5315a0)
[26075] dbg: plugin: loading Mail::SpamAssassin::Plugin::AWL from [at] INC
[26075] dbg: plugin: registered
Mail::SpamAssassin::Plugin::AWL=HASH(0xa529e0c)
[26075] dbg: plugin: loading
Mail::SpamAssassin::Plugin::AutoLearnThreshold from [at] INC
[26075] dbg: plugin: registered
Mail::SpamAssassin::Plugin::AutoLearnThreshold=HASH(0xa6ec7e c)
[26075] dbg: plugin: loading
Mail::SpamAssassin::Plugin::WhiteListSubject from [at] INC
[26075] dbg: plugin: registered
Mail::SpamAssassin::Plugin::WhiteListSubject=HASH(0xa6f5880)
[26075] dbg: plugin: loading Mail::SpamAssassin::Plugin::MIMEHeader
from [at] INC
[26075] dbg: plugin: registered
Mail::SpamAssassin::Plugin::MIMEHeader=HASH(0xa6ffaf4)
[26075] dbg: plugin: loading Mail::SpamAssassin::Plugin::ReplaceTags
from [at] INC
[26075] dbg: plugin: registered
Mail::SpamAssassin::Plugin::ReplaceTags=HASH(0xa709174)
[26075] dbg: config: adding redirector regex:
/^http:\/\/chkpt\.zdnet\.com\/chkpt\/\w+\/(.*)$/i
[26075] dbg: config: adding redirector regex:
/^http:\/\/www(?:\d+)?\.nate\.com\/r\/\w+\/(.*)$/i
[26075] dbg: config: adding redirector regex:
/^http:\/\/.+\.gov\/(?:.*\/)?externalLink\.jhtml\?.*url=(.*? )(?:&.*)?$/i
[26075] dbg: config: adding redirector regex:
/^http:\/\/redir\.internet\.com\/.+?\/.+?\/(.*)$/i
[26075] dbg: config: adding redirector regex:
/^http:\/\/(?:.*?\.)?adtech\.de\/.*(?:;|\|)link=(.*?)(?:;|$) /i
[26075] dbg: config: adding redirector regex:
m'^http.*?/redirect\.php\?.*(?<=[?&])goto=(.*?)(?:$|[&\#])'i
[26075] dbg: config: adding redirector regex:
m'^https?:/*(?:[^/]+\.)?emf\d\.com/r\.cfm.*?&r=(.*)'i
[26075] dbg: plugin:
Mail::SpamAssassin::Plugin::ReplaceTags=HASH(0xa709174) implements
'finish_parsing_end'
[26075] dbg: replacetags: replacing tags
[26075] dbg: replacetags: done replacing tags
[26075] dbg: bayes: tie-ing to DB file R/O
/home/skip/.spamassassin/bayes_toks
[26075] dbg: bayes: tie-ing to DB file R/O
/home/skip/.spamassassin/bayes_seen
[26075] dbg: bayes: found bayes db version 3
[26075] dbg: bayes: DB journal sync: last sync: 1138192116
[26075] dbg: config: score set 3 chosen.
[26075] dbg: learn: initializing learner
[26075] dbg: bayes: bayes journal sync starting
[26075] dbg: locker: safe_lock: created
/home/skip/.spamassassin/bayes.lock.pelorus.26075
[26075] dbg: locker: safe_lock: trying to get lock on
/home/skip/.spamassassin/bayes with 0 retries
[26075] dbg: locker: safe_lock: link to
/home/skip/.spamassassin/bayes.lock: link ok
[26075] dbg: bayes: tie-ing to DB file R/W
/home/skip/.spamassassin/bayes_toks
[26075] dbg: bayes: tie-ing to DB file R/W
/home/skip/.spamassassin/bayes_seen
[26075] dbg: bayes: found bayes db version 3
[26075] dbg: locker: refresh_lock: refresh
/home/skip/.spamassassin/bayes.lock
[26075] dbg: locker: refresh_lock: refresh
/home/skip/.spamassassin/bayes.lock
[26075] dbg: bayes: synced databases from journal in 0 seconds: 1045
unique entries (1630 total entries)
[26075] dbg: bayes: bayes journal sync completed
[26075] dbg: bayes: expiry starting
[26075] dbg: locker: refresh_lock: refresh
/home/skip/.spamassassin/bayes.lock
[26075] dbg: locker: refresh_lock: refresh
/home/skip/.spamassassin/bayes.lock
[26075] dbg: bayes: DB expiry: tokens in DB: 95467, Expiry max size:
150000, Oldest atime: 1098802920, Newest atime: 1138269721, Last
expire: 0, Current time: 1138273502
[26075] dbg: bayes: expiry completed
[26075] dbg: bayes: untie-ing
[26075] dbg: bayes: untie-ing db_toks
[26075] dbg: bayes: untie-ing db_seen
[26075] dbg: bayes: files locked, now unlocking lock
[26075] dbg: locker: safe_unlock: unlink
/home/skip/.spamassassin/bayes.lock
Learned tokens from 0 message(s) (0 message(s) examined)
I have one other strange problem that perhaps someone could help me
with. When piping my mail through spamassassin via procmail, I cannot
get the subject to change for spam messages (i.e., adding "[SPAM]" to
the front of the subject). But, if I run it through spamassassin
(perhaps using the "-D" option), the subject line is modified as I
would expect. The other headers are added, but I cannot get the
subject line to change. I have the "report_safe 0" and the "rewrite
header Subject [SPAM]" options in my local.cf file, and like I said,
"-D" reports that it is reading the file. Any ideas why the
difference?
Skip
