jump to navigation

My big problem with WordPress May 13, 2006

Posted by curtmonash in Uncategorized.

Every piece of software has its glitches, but on the whole WordPress is a great system. Even so, there's one single reason that I look forward eagerly to the day I muster up the energy to port most or all of my sites away from WordPress: Single-parent category hierarchies.

I write a lot, and I don't really expect very many people to read my output in its entirety. Rather, I think they'd prefer to choose to take feeds in particular subject areas, hopefully broad ones.

Organizing my posts into such categories is really hard with a strictly hierarchical ontology. I need the multiple-parent ontology of Drupal.

Unfortunately, there's a lot else that seems to be missing or over-hard in Drupal. And it doesn't help that every stupid technical non-feature of the Computerworld blogs, when I blogged there, was blamed by the developers on "Drupal," deservedly or not.

I can’t make Pages work May 13, 2006

Posted by curtmonash in Uncategorized.
add a comment

On my main blogs, I can't create a Page. I apparently create one, but the URL doesn't work, even in the preview panel.

I can do it here, but not in those. So I imagine it has something to do with the theme, Sharepoint-like. I distinctly recall having this problem even before I ever hacked around with any customization of the theme.

Akismet lag May 13, 2006

Posted by curtmonash in Uncategorized.
add a comment

I had a case today in which Akismet let through several copies of the same spam, and blocked several other copies of the same spam on the same blog.

My guess (and I should have confirmed this by looking carefully at any available timestamps) is that it takes time to recognize a particular spam that's being sent out arouind the world, and I had the misfortune to be one of the "early adopters".

EDIT:  Nope.  Occasional copies of the same spam continue to get through, on the same blog and others.  Weird … 

A workaround that helps identify future spam May 7, 2006

Posted by curtmonash in Uncategorized.
add a comment

In a previous post I complained that if Akisment and my other filters let a bunch of spam comments through, I could mass delete them, but I couldn't mass-identify them as spam for Akismet's edification. (Uh, I'm assuming that I recall correctly that Akismet depends on being trained …)

Well, I've found a workaround that doesn't take too many steps.

1. Notice a common word to a set of spam that got through (there will usually be one; today's example was "vimax")

2. Add the word to the list of keywords triggering comment moderation in Options/Discussion. Update options. Then select the choice to refilter comments.

3. Move the word from the moderation list to the blacklist list. Update options again.

4. Go to Manage/Comments and classify all the newly moderated items as spam.

There are a few more clicks than I indicated above, but the whole process isn't too terrible. It would be a lot easier just to have a "Mark as spam" option separate from the "Delete" option in mass comment editing, of course, but this is something of a workaround.

54 more spam caught by Akismet May 4, 2006

Posted by curtmonash in Uncategorized.
add a comment

On DBMS2.com. That 150 comment limit for review really needs a workaround.

But accuracy was 100%.

Of course, there only were about three different spams in the bunch; the rest was a lot of repetition.

I hope the technology switch to Akismet isn't somehow making spammers MORE likely to hit me.  It's probably just coincidence; I don't have a clear mechanism in mind as to how the connection would work.

Early Akismet results May 3, 2006

Posted by curtmonash in Uncategorized.
add a comment

Over the past few days, Akismet caught 128 pieces of alleged comment spam on monashreport.com. One was a false positive — a trackback from my own blog, which is odd. The rest were real, generally lots of copies of the same thing.

It also let 7 copies of one spam through, but it's tough to fault it for that. It was a spam without a "call to action." Just the word "redrum" (read that backwards) a LOT. Maybe the bot posting it came from somebody unhappy that other spam was stopped? Who knows … Ironically, the post hit was my latest, which is one about spam. 🙂

On DBMS2.com, the figure is 187 alleged spam caught. The first 150 were indeed spam. I DON'T KNOW WHETHER THE OTHERS WERE, however, and to delete them blind.

The 150 item limit on reviewing spam — i.e., the lack of a "next page" — REALLY needs to be fixed.

Identifying comment spam April 28, 2006

Posted by curtmonash in Uncategorized.

It's no problem to make up a word list that catches half or more of all spam without a major false positives problem. Just think of the things people most commonly advertise through spam — porn, medication, loans. Porn and medication words are unlikely to give false positives, although loan words are more of a problem. And some of the worst offender sites can get added to the list manually.

Well, actually, on some of my blogs one might explicitly discuss the problem of spam, in which case any word could show up in the comments. Whoops.

Anyhow, that problem even aside, a large minority of spam can't easily be identify with keyword/keyphrase filters. They're just a bland friendly sentence or two (changing in form often enough that you can't catch even the majority of them with keyphrase filters, although I've gotten some mileage out of filtering on "nice blog" and the like). Their purpose is simply to put a URL onto your site.

And thus the key to filtering comment spam is, in even a more extreme form, the same as the key to filtering email spam — filter on the "call to action" (usually a URL). Whether it's to click on a URL, buy a stock, or whatever, almost all spammers want you to do something specific. The part of the spam in which they describe precisely what that is the part that is sufficiently invariant to make effective filtering possible.

I hope (and believe) that's part of what Aksimet is doing. But I gotta say this — based on the Monash Report, which is one of the blogs I've turned it on for, it definitely lets pretty obvious spam through. I mean, c'mon now — just how many comments from bettingonhorses.org is it going to take before Aksimet starts filtering them out??

On the plus side, Akismet does capture half or so of my spam so far, in a small sample size, and I've decided safe to turn on even on my busiest blog. The problem is that it tells you how much new spam there is, and also how much old spam, and doesn't make it clear whether it grabbed old "spam" just from your already-deleted-comments file.

However, a web search doesn't show people screaming about a real problem, so there probably isn't one. What's more, the UI text is a little more comforting when it has both old and new spam in the queue than just old spam. So, as I said, I've now turned it on even on my busiest blog.

By the way, that failed search for complaints did turn up a couple of interesting things. One is pretty much the original Akismet-popularizing blog post. The other contains a long list of "social engineering" comments appearing to be real, an all-too-high fraction of which are quite familiar to me already.

Different themes April 26, 2006

Posted by curtmonash in Uncategorized.
add a comment

One thing I’m using this blog for is to experiment with different themes. On my main blogs, set up in WordPress 1.5, I went with all the same theme — Sharepoint-like, which I lightly customized. Mainly, I selected it for its efficient use of space. And nested categories; at first, I wasn’t will to hack around that on my own. And I like the colors; blue/yellow/white is what decorated my room when I was growing up, and were major colors in my favorite New York City apartment as well.

But if I keep it I need to fiddle with the colors a little; that yellow text is pretty unreadable where used. And I’ve found I make a lot of long posts, so a theme that automatically truncated them on the first page (with suitable links to the whole thing) might be good for me. Like Hemingway, the theme I’m playing with here at the moment I write this. Another nice thing about Hemingway is that the About part seems to just be an editable page, rather than being hardwired into the code the way it is in Sharepoint-like. (I just deleted it altogether, put that info into initial blog posts, and then made one of my post categories “About This Blog”. Not the worst idea, if I may say so myself …) And yet another one is that there’s a convient place for the titles of recent posts, something that for some reason doesn’t wind up being in the themes I otherwise prefer.

No one theme ever exactly suits me. I foresee a little more hacking in my future …

WordPress and Akismet weirdness April 26, 2006

Posted by curtmonash in Uncategorized.
add a comment

I had no intention of starting yet another blog. However, in order to activate the much-needed Akismet antispam feature, I seemingly was forced to sign up for this account.

So it occurs to me — I might as well use it to document my dealings with WordPress and similar software, when I am in the mood to write on such subjects.

Come to think of it, listing the series of steps that led me here is too tedious. Suffice it to say that setup of the Aksimet plug-in is — well, it's weird.

One interesting thing is that I learned deleted old comments aren't really deleted. The "last 15 days of Aksimet spam" on DBMS2.com turned out to include 626 items, almost all of which I'd already deleted, most of which were more than 15 days old.

What's particularly weird is that the system helpfully suggests that since a mass delete can't be undone, I should check all 626 items to make sure I want to delete them — and then doesn't provide any visible way of actually checking more than the first 150 of them. That is, and I use the term advisedly, an interesting design choice.

Another odd design choice, although this may just be a matter of difficulty in coding, is this: There's no way to Mass Edit spam that Akismet missed, and mark it as spam. I currently have 20 spam comments that Aksimet missed on the blog. I can Mass Delete them in one go, or I can tell Akismet about all 20 of them, one after the other. Interesting choice on my part; just how public-minded am I?

Actually, I just went back and deactivated Akismet on DBMS2.com. A pity. I need something like that, and maybe after I find discussion forums on it I'll feel more secure with it and turn it back on, but at the moment it seems just too risky to use exactly where it is most needed — on an active blog getting inundated with comment spam.

But I left it turned on for now on the other three.

Mass Edit still REALLY needs the feature "Mark as Spam"