19 May 2010

The First Cameron Cabinet (Annotated; Final; Venn)

The Guardian's datablog published a breakdown of the cabinet, which has now been updated to include all 29 members and attendees. The Guardian also provided various information about each member, including sex, education, ethnicity etc. I'm hoping the data is now stable, so that this won't need to change again.

Here's an annotated version Miró's view of it on a 5-D nested Venn diagram, with yellow Lib Dems and true blue Tories.

cabinet-annotated.png

So on the main Venn Diagram we see that 19 (6 +8 + 4 + 1) of the 29 cabinet ministers so far announced are white males, educated at Oxbridge. Of those nineteen

  • six are under 45 and went to a private school;
  • eight are 45 or over and went to a private school;
  • one is under 45 and didn't go to a private school;
  • four are 45 and over and didn't go to a private school.

Of the remaining ten, six are white males.

The raw data is available from the Guardian Datastore; the original table I worked from (as presented by Miró) is here:

CabinetTableFinal.png

Although Miró did most of the work, I did annotation, so it's entirely possible that I have mislabelled the discs.

Labels: , , ,

13 May 2010

Cabinet Breakdown (improved)

Here's a slightly improved version of that breakdown of the new cabinet, with the natural colour coding (Lib Dems in yellow, Tories in blue).

cabinet-titled-party.png

Labels: , , ,

12 May 2010

The New Cabinet (so far): The Nested Venn Diagram

The Guardian's datablog has just published a breakdown of the cabinet, as announced so far, by various attributes such as sex, education, ethnicity etc. Here's Miró's view of it on a 5-D nested Venn diagram.

cabinet.png

So on the main Venn Diagram we see that 15 (1 + 5 + 5 + 4) of the 22 cabinet ministers so far announced are white males, educated at Oxbridge. Of those 15, 5 are under 45 and went to a private school, 5 are 45 or over and went to a private school, 1 is under 45 and didn't go to a private school and 4 are 45 and over and didn't go to a private school.

Of the remaining seven, five are male and white but didn't go to Oxbridge (1, privately schooled, all 45 or over).

The two who stand out as different are the two women, Baroness Sayeeda Warsi (lower right red disc), who is not male, white, Oxbridge-educated, privately schooled or 45 or over, and Theresa May (next lowest, next right-most red disc), who wasn't privately educated.

The raw data is available from the Guardian Datastore, but since it may be updated, the original table I worked from (as presented by Miró) is here:

Cabinet-Raw-Data.png

Labels: , , , ,

25 February 2010

Venn in Wonderland: Paper on Nested Venn Diagrams & Lewis Carroll Diagrams Available

I did a bit more work on those nested Venn Diagrams, including research that turned up an alternative to Venn Diagrams produced by Lewis Carroll, known naturally enough as Lewis Carroll Diagrams.
Carroll234.png
You may recognize the two-set version as the same structure used in the more recent "Boston Box".
BostonBox.png
It transpires that Carroll used nesting of his diagrams as far back as 1896. Here is his diagram for eight sets.
Carroll8.png
All this and more is described, with including the relationship to the Nested Venn Diagram I described in the previous post, in a paper entitled "Nested Venn Diagrams" from Stochastic Solutions.

Labels: , , ,

10 February 2010

The Nested Venn Diagram

Although this will be a long blog post, the essence of it is a single image, which I'm hoping is all you need to know. Here is the Big Idea, the Nested Venn Diagram:

GuardianTechVenn6small.png

If the picture is immediately self-explanatory, you need read no further; all else is mere elaboration, and I am a happy man. The six sets illustrated relate to the twitter users named (all members of the Guardian Technology's team) and the numbers in the intersections show the number of people they follow in common. At the centre, you will see that Jack Schofield (@jackschofield), Charles Arthur (@charlesarthur), Bobbie Johnson (@bobbiejohnson), Aleks Krotoski (@aleksk), Jemima Kiss (@jemimakiss), and Victor Keegan (@vickeegan) follow five users in common. Similarly, You can see that Aleks and Jemima follow six people who none of the men do, and that the men all follow two who neither of Aleks or Jemima do. (Note, this was as at 10th February 2009; obviously the following relationships may change.)

If you want to find out who they follow in common, tickery.net which lets you look at the intersection of any set of twitter users following relationships. (The links above use Tickery.) (Disclosure: Tickery is built by FluidInfo on its wonderful Fluid DB database; I am a shareholder in and advisor to Fluidinfo Limited.)

The Back Story

A client wanted, among other things, a Venn Diagram to show the which combinations of web sites a set of users visited. This presented two challenges. First, my software packages of choice (Miro and Klee), didn't technically support Venn Diagrams at the time of the request. That, however, was easily solved; after all, it's just code. The second problem was more serious. The number of websites he wanted to illustrate was not two or three, or even four, but six.

Six!

A six-dimensional Venn Diagram is a challenge. I had a vague recollection that no lesser person than Venn himself had come up with a construction that in principle allows an Venn Diagrams to be constructed in an arbitrary number of dimensions. But I also recalled that whenever I looked at such constructions, my head hurt. As Vic Reeves1 might say, well over 99 per cent of all Venn Diagrams in standard use show either two or three sets. I have seen four; I don't believe I have ever seen five used for anything other than explaining how to construct a five-dimensional Venn Diagram. If you're interested, here is Venn's constructions for five dimensions

500px-Venn5.png

which compares to my nested venn diagram construction,

NestedVenn5.png

and here is his construction for six sets

500px-Venn6.png

which compares to the nested Venn Diagram at the top of this article. (The images illustrating Venn's constructions were lifted from the Wikipedia article on Venn Diagrams, and were provided by Kopophex. Thanks, Kopophex.)

My solution, as you have probably gathered, is nesting. The image below shows all sixty-four possible memberships for six sets, which I have imaginatively labelled A through F. The large circles represent sets A, B and C; each of the small Venn Diagrams represents sets D, E and F. By placing a copy of the small Venn Diagram, in each of the eight positions corresponding to the various intersections of A, B, and C, we get a unique position on the diagram for each of the 64 combinations of set memberships for A, B, C, D, E and F. In case this isn't clear, here is a labelled version.

NestedVenn6Annotated.png

While this solution is far from perfect, so far the reaction from colleagues and others seems to have been positive. Certainly, I find this representation incomparably easier to digest than Venn's clever-but-extremly-difficult-to-read versions. And more significantly, on two occasions I have now gained insights from using these that I had previously failed to elecit from the data by alternative methods. I will follow-up, as time and clients permit, with some examples of their use.

Generalizing Further

It goes without saying that this technique can easily be generalized, to nesting copies of any n-set Venn Diagram in the various intersections of an N-set Venn diagram to yield a nested Venn Diagram in (n + N) dimensions (i.e for n + N sets). In principle, one could obviously go even further, nesting an arbitrary number of levels, but I have severe doubts about the utility of nesting more than once. I had thought that six was probably the largest number of dimensions (sets) the technique handles elegantly, but in fact have now implemented versions for seven and eight sets. Extending the Guardian Tech team to include its new member, Mercedes Bunz (sic; @MrsBunz), we get:

GuardianTechVenn7.png

and adding in Kevin Anderson @kevglobal, we get to:

GuardianTechVenn8.png

This obviously leaves just one question: who is the one person worthy of being followed by all eight (of these) Guardian Technology writers? You'll have to go to Tickery to find out.

Notes

1 for it was he, you will recall, who made the famous observation that 88.2% of statistics are made up on the spot.

Labels: , , , , , , , , ,