Email addresses mined from Thudfactor

Date April 22, 2008

If you are a commenter here, I owe you an apology. Yesterday I integrated the SezWho comment plugin which allows comment and reputation-tracking across weblogs. I assumed that only registered users of SezWho would be subject to this tracking. That assumption was incorrect.

When I activated the plugin, it transmitted six years worth of unencrypted commenter email addresses back to the SezWho main database so the SezWho application could begin to build a comment-dossier on those addresses.

In my programming career there have been several grand Ooops moments. There was the time I dumped a month’s worth of vitally important submitted document commentary for a major association’s big meeting. There was the time I accidentally spammed Capitol Hill. And then there was this moment.

I immediately contacted SezWho’s developers who have spent some time with me explaining how the system works. They have agreed to remove all Thudfactor-related content from their database, and I take them at their word on this. People who are mining addresses for nefarious spammy purposes don’t have working email addresses, much less trade email back and forth on application internals. I really think they are just plugin developers trying to solve a tricky problem and provide a valuable service.

They didn’t have to offer to remove the data or even respond to my email message. So I believe they are coding in good faith and aren’t going to go around and sell mailing lists behind my back.

But that’s only my impression, and it would have been much better to not have to rely on the kindness of strangers.

The moral of the story

The moral of the story is different if you are a site owner or just a commenter. I should have been more careful about installing this plugin. Apparently any plugin can access the database tables and phone home without any warning from Wordpress itself. In this case it was a plugin that explicitly collected and matched data on commenters, but it could just as easily have been something innocuous like a “random image” plugin. Indeed, plugins with injected malicious code have started turning up a number of places — usually someone else’s plugin with the evil code injected. The site 5ThirtyOne suggests you download your plugins from the original author’s web site but this will not necessarily protect you if the author him or herself is the person injecting the code. In this case the intent was not malicious, but it easily could have been.

For commenters, it’s a good reminder that you should have a disposable public address (or several) for spreading around and a private address for trusted communication. I know many of you already do this. But yes, every site you use your email address on is another chance for your address to be lifted. Even years after your last interaction.

And, unfortunately, even here at Thudfactor.

The plugin itself has been disabled and I want to re-iterate that SezWho says they are cleaning the database of the content I inadvertently sent them yesterday. I’m sorry, and I’ll do my best to start thinking of my database as a valuable collection of personal data rather than just my hobby.

6 Responses to “Email addresses mined from Thudfactor”

  1. Nick said:

    DAMMIT JOHN! God….

  2. Fred said:

    I haven’t seen a sudden increase in spam, but if I do, I’ll know who’s to blame. ;)

    In all seriousness, I do appreciate your letting us know. My concern isn’t so much this little mistake as it is my own use of Wordpress. I’m not sure I’d even notice malicious code in a plugin when I downloaded it.

  3. IshMEL said:

    There was the time I accidentally spammed Capitol Hill.

    Story! Story!

  4. Jitendra said:

    John,

    Thanks for an interesting conversation and for raising the issues…We did not use the md5 encoding that was used by Gravatar as it is not guaranteed to generate unique keys…But we will investigate other approaches here.

    In addition, I want to reiterate that the users on the board will not be getting any spam emails etc. so there is no cause for concern.

    John, I’ll ping you soon to work through the process of getting rid of the data and to explain how you can verify it.

    Thanks, Jitendra
    SezWho

  5. Nancy Cole said:

    Hi John,

    Sounds like a stressful day! If you’re still interested in adding comments, check out js-kit.com. We don’t track reputation of commenters. You will also find ratings, polls and scores there — all very easy to install.

    Best wishes,

    Nancy Cole
    js-kit.com

  6. some call him..... said:

    Meh. Stuff happens. Yeah, and your story of spamming Capital Hill was WAY cooler.

Leave a Reply

XHTML: You can use these tags: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>