BoI data breach: a sample customer notification

More on the Bank of Ireland 30,000-customer data breach (which is up to 31,500 people by now — BoI promised to contact the “affected” customers by post, warning them that their data had been leaked. If you were wondering what those letters might look like, wonder no more. Here’s one, via a friend who found himself in this unenviable position:

So it’s not just name, date of birth, and address — he notes that they’ve leaked ‘information on the current account I use to pay for the policy.’

Interestingly, he says that his life assurance policy was set up directly with their life assurance department, not via the local branch — which directly contradicts what BoI say on their website:

The laptops contained information relating to some customers who either obtained a quote or took out a Life Assurance policy with Bank of Ireland Life from the following branches: [... list of branches omitted...]

The update from 28 April doesn’t clarify this, either. Hmm.

Tags: , , , , , , ,

Comments (1)

CEAS

Spam: back from CEAS. The schedule with links to full papers is up, so anyone can go along and check ‘em out, if you’re curious.

Overall, it was pretty good — not as good as last year’s, but still pretty worthwhile. I didn’t find any of the talks to be quite up to the standards of last year’s TCP damping or Chung-Kwei papers; but the ‘hallway track’ was unbeatable ;)

Here’s my notes:

AOL’s introductory talk had some good figures; a Pew study reported that 41% of people check email first thing in morning, 40% have checked in the middle of the night, and 26% don’t go more than 2-3 days without checking mail. It also noted that URLs spimmed (spammed via IM) are not the same as URLs spammed — but the obfuscation techniques are the same; and they’re using 2 learning databases, per-user and global, and the ‘Report as Spam’ button feeds both.

Experiences with Greylisting: John Levine’s talk had some useful data — there are still senders that treat a 4xx SMTP response (temp fail) as 5xx (permanent fail), particularly after end of the DATA phase of the transaction, such as an ‘old version of Lotus Notes’; and there are some legit senders, such as Kodak’s mail-out systems, which regenerate the body in full on each send, even after a temp fail, so the body will look different. He found that less than 4% of real mail from real MTAs is delayed, and overall, 17% of his mail traffic was temp-failed. The 4% of nonspam that was delayed was delayed with peaks at 400 and 900 seconds between first tempfail and eventual delivery.

As usual, there were a variety of ‘antispam via social networks’ talks – there always are. Richard Clayton had a great point about all that: paraphrasing, I trust my friends and relatives on some things, and they are in my social networks — but I don’t trust their judgement of what is and is not spam. (If you’ve ever talked to your mother about how she always considers mails from Amazon to be spam, you’ll know what he means.)

Combating Spam through Legislation: A Comparative Analysis of US and European Approaches:
the EU ‘opt-in’ directive is now transposed everywhere in the EU; EU citizens who are spammed by a citizen from another EU country, the reports should be sent to the antispam authority in the sender’s country; and there’s something called ‘ECNSA’, an EU contact network of spam authorities, which sounds interesting (although ungoogleable).

Searching For John Doe: Finding Spammers and Phishers: MS’ antispam attorney, Aaron Kornblum, had a good talk discussing their recent court cases. Notably, he found one cases where an Austrian domain owner had set up a redirector site which sounded like it was expressly set up for spam use — news to me (and worrying).

A Game Theoretic Model of Spam E-Mailing: Ion Androutsopoulos gave a very interesting talk on a game theoretic approach to anti-spam — it was a little too complex for the time allotted, but I’d say the paper is worth a read.

Understanding How Spammers Steal Your E-Mail Address: An Analysis of the First Six Months of Data from Project Honey Pot: Matthew Prince of Project Honeypot had some excellent data in this talk; recommended. He’s found that there’s an exponential relationship between google Page Rank and spam received at scraped addresses, which matches with my theory of how scrapers work; and that only 3.2% of address-harvesting IPs are in proxy/zombie lists compared to 14% of spam SMTP delivery IPs. (BTW, my theory is that address scraping generally uses Google search results as a seed, which explains the former.)

Computers beat Humans at Single Character Recognition in Reading based Human Interaction Proofs (HIPs): this presented some great demonstrations of how a neural network can be used to solve HIPs (aka CAPTCHAs) automatically. However, I’m unsure how useful this data is, given that the NN required 90000 training characters to achieve the accuracy levels noted in the paper; unless the attacker has access to their own copy of the HIP implementation they can run themselves, they’d have to spend months performing HIPs to train it, before an attack is viable.

Throttling Outgoing SPAM for Webmail Services: cites Goodman in ACM E-Commerce 2004 as saying that ESP webmail services are a ’substantial source of spam’, which was news to me! (less than 1% of spam corpora, I’d guess). It then discusses requiring the submitter of email via an ESP webmail system to perform a hashcash-style proof-of-work before their message is delivered. By using a Bayesian spam filter to classify submitted messages, the ESP can cause spammers to perform more work than non-spammers, thereby reducing their throughput. Didn’t strike me as particularly useful — Yahoo!’s Miles Libbey got right to the heart of the matter, asking if they’d considered a situation where spammers have access to more than one computer; they had not. A better paper for this situation would be Alan Judge’s USENIX LISA 2003 one which discusses more industry-standard rate-limiting techniques.

SMTP Path Analysis: IBM Research’s anti-spam team discuss something very similar to several techniques used in SpamAssassin; our versions have been around for a while, such as the auto-whitelist (which tracks the submitter’s IP address rounded to the nearest /16 boundary), since 2001 or 2002, and the Bayes tweaks we added from bug 2384, back in 2003.

Naive Bayes Spam Filtering Using Word-Position-Based Attributes: an interesting tweak to Bayesian classification using a ‘distance from start’ metric for the tokens in a message. Worth trying out for Bayesian-style filters, I think.

Good Word Attacks on Statistical Spam Filters: not so exciting. A bit of a rehash of several other papers — jgc’s talk at the MIT conference on attacking a Bayesian-style spam filter, the previous year’s CEAS paper on using a selection of good words from the SpamBayes guys, and it entirely missed something we found in our own tech report — that effective attacks will result in poisoned training data, with a significant bias towards false positives. In my opinion, the latter is a big issue that needs more investigation.

Stopping Outgoing Spam by Examining Incoming Server Logs: Richard Clayton’s talk. Well worth a read. It’s an interesting technique for ISPs — detecting outgoing spam by monitoring hits to your MX from your own dialup pools which uses known ratware patterns.

Tags: , , , , , , , , ,

Comments

Kentucky sez ‘Opt-Out Still Doesn’t Work’

Spam: Some fantastic data in this paper from the Kentucky Long-Term Policy Research Center.

It’s a brief 2-pager detailing the effectiveness of the CAN-SPAM Act in reducing the spam load, using a set of test addresses. The methodology is pretty good.

One point in particular is very important: ‘opting out’ from spam Just Does Not Work. This graph tells the whole story:

After opting out from spams received, the amount of spam received at those ‘opted out’ test addresses actually rose. (This even after CAN-SPAM made such activity explicitly illegal.)

Some other data:

  • obfuscating addresses on web pages is still working; 7.7 times the spam is received if you don’t bother doing so.
  • e-mail harvesting also continues after CAN-SPAM made it illegal.

If anyone needed proof, this shows that spammers are quite happy to break the law; strong enforcement ‘teeth’ are needed for any anti-spam legislation. (UK, take note: the thoroughly useless system whereby spam complaints must be submitted on paper isn’t going to help!)

The Technical Details document also notes something interesting: one test address was set up to test ‘opting out’ of legitimate mass mail from some (unnamed) big websites, and continued to receive ads ’sometimes months after opting-out’. For shame!

(thx to John Levine for forwarding the links.)

Spam: Michael Radwin on open HTTP redirectors, and in particular noting that Yahoo! have (finally) closed their main one down. One down, several hundred to go ;)

Good history of the exploitation techniques that spammers have been using, too.

Tags: , , , , , , , , ,

Comments

Secsed-up

Humour: Data::Secs2 — canoncial string for nested data. A format for representing nested data structures in accordance with SEMI E5-94, Semiconductor Equipment Communications Standard 2 (SECS-II), apparently pronounced “’sex two’ with gusto and a perverted smile.”

The manual page goes on:

In order not to plagarize college students, credit must be given where credit is due. Tony Blair, when he was a college intern at Intel Fab 4, in London invented the SEMI SECS standards. When the Intel Fab 4 management discovered Tony’s secsification of their host and equipment, they elected to have security to escort Tony out the door. This was Mr. Blair’s introduction to elections which he leverage into being elected prime minister. In this new position he used the skills he learned at the Intel fab to secsify intelligence reports on Iraq’s weopons of mass distruction.

‘Secsed-up’, surely!?

Tags: , , , , , , , , , ,

Comments

RSA, Z/Yen report open WiFi hot-spots used to send spam

Well, this is bad news. It seems one of the biggest bugbears for open Wifi hot-spots, ‘what if it’s used to spam’, may now be happening on a wide scale…

Unauthorized WLAN Connections Used to Send Spam (2 April 2003)

Data gathered from a wireless LAN (WLAN) honeypot showed that nearly 75% of intentional unauthorized connections made were used to send spam. (newsfactor.com)

The honeypots were set up in the City of London for 2 weeks, as default, open WLANs. This is the nearest I can come to a source. Both RSA Security UK and Z/Yen don’t list it on their press releases pages.

My thoughts: it could be the Jeem or Rewt spam-relaying trojans searching for open nets automatically, from infected machines. Strikes me that there wouldn’t be too many spammers war-driving around London, in person.

Thanks to Tony Earnshaw for forwarding it on from SANS NewsBytes…

Date: 09 Apr 2003 19:57:32 +0200
From: Tony Earnshaw (spam-protected)
To: (spam-protected)
Subject: SANS Newsbytes for today

SANS stuff is always interesting; those who care about their network and computer security should really subscribe - not to mention the SANS GIAC stuff.

The undermentioned is interesting to SA Talk.

– Unauthorized WLAN Connections Used to Send Spam (2 April 2003) Data gathered from a wireless LAN (WLAN) honeypot showed that nearly 75% of intentional unauthorized connections made were used to send spam.

http://www.newsfactor.com/perl/story/21168.html

Tony

Tony Earnshaw

e-post
tonniatbillydotdemondotnl
www

http://www.billy.demon.nl


This SF.net email is sponsored by: Etnus, makers of TotalView, The debugger for complex code. Debugging C/C++ programs can leave you feeling lost and disoriented. TotalView can help you find your way. Available on major UNIX and Linux platforms. Try it free. www.etnus.com

Spamassassin-talk mailing list (spam-protected) https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Tags: , , , , , , , , ,

Comments (1)

Proposed Irish data retention laws

Karlin notes this about ‘the extraordinary letter the Department of Justice sent out this week to various parties’.

According to the letter, the Department will hold a preliminary forum to ‘initiate’ a consultation process on its proposed three-year data retention bill … The forum begins at 3pm — clearly making sure no long and unruly discussions will develop! — and starts with a 20-minute address by the Minister, followed by a 20-minute address by the Dept of Communications on the 1997 EU Data Privacy Directive (which, BTW, Ireland STILL has not implemented despite being under legal threat by the EU — and note that there’s no mention of the far more crucial 2002 amended Directive, voted in last May by a spineless and ill-informed EU Parliament, which allows for up to SEVEN YEARS data retention.

Then — and this is the amazing bit — attendees get a 20 minute pep talk by An Garda Siochana (the Irish police force) ‘on the contribution of data retention in the fight against crime.’

When you pick yourself up off the floor, remind yourself that this is the Irish government’s formal initiation of a purported public discussion on data retention — brought to you by the Irish police. Amazing. You’d have thought they’d at least *pretend* to be balanced and disinterested, and perhaps ask Joe Meade, the Irish Data Protection Commissioner, to contribute as well. …

The Department of Justice itself should have nothing whatsoever to do with ANY consultation process on this proposed bill. Instead, as in the UK, an independent Dail group should hold hearings and get public input into this.

Tags: , , , , , , , , ,

Comments