The thoughts of a web 2.0 research fellow on all things in the technological sphere that capture his interest.

Thursday 18 March 2010

Welcome StumbleUpon - and other members of my recent spike

Unsurprisingly, my Webometric Thoughts aren't massively popular. There are few people who start the day checking the BBC, the Guardian, and then Webometric Thoughts. However, over the last few days my traffic has gone through the relative roof, from a steady 100 unique visitors a day, on Tuesday it leaped to 602!

Way beyond the previous high of 262. The reason: For a brief moment I was the TechCrunch pin-up boy, thanks to my (now-very-old) QR code T-shirt - nb. it goes without saying that this rather large company that clears $200,000 a month (according to Wikipedia) didn't bother asking my penniless permission.

What's particularly interesting is that hardly any of the traffic has come directly from TechCrunch, in fact only 112 of the visits over the last three days. Instead the traffic has been mostly a massive surge of visits to my home page from StumbleUpon. I'm not sure why, but nonetheless - Hello Stumbleupon Users *waves*

Labels: ,

posted by David at | 1 Comments

Tuesday 9 March 2010

How bad is Chatroulette?

Everywhere I turn at the moment there seems to be a story about Chatroulette.com. Press a button and you are in a random video chat with a stranger somewhere else in the world. Unsurprisingly it is painted as the latest sign of the world going to hell in a handcart: "Who will protect the children?"

As a particularly unsocial social media researcher I decided to do a quick quantitative study of first impressions of the people I came across on the site: clothed or naked/obscene, male or female. As I didn't particularly want to engage with anyone, but needed to put the web cam on to encourage the broadest cross-section, I set it up for Mr Shifter:

Results
Out of 100 web cams in which the subject was identifiable.
79% were men.
5 contained more than one man.
11 were obscene.
10% were female.
2 contained more than one woman.
1 was obscene.
2% were mixed sex groups
9% were objects
- mostly signs saying "show me you boobs".
In addition, I also came across one camera supposedly of a man who had just hung himself...I wasn't too sure where to place that one.

So what did I find out? The world is mostly just looking to talk, there's some weirdos out there, and one bloke who wanted to see the monkey dance...and was thrilled when he obliged.

Labels: ,

posted by David at | 0 Comments

Academic Search Engine Optimization: An inevitable evil?

The money available for public science is finite, and it is understandable that governments want to get value for public money spent, and show the value in the form of bibliometric and webometric indicators. Unfortunately scientists are far from perfect, and the indicators and metrics that are meant to reflect the merits of an academic's work can quickly become the focus of the academics work.

I've just finished reading Academic Search Engine Optimization (ASEO): Optimizing Scholarly Literature for Google Scholar & Co. (via @research_inform), which gives advice on making sure your journal articles are indexed and highly ranked by academic search engines (e.g., Google Scholar). There are numerous points I disagree with on both an ethical and a practical level:
  • "...tools that help in selecting the right keywords, Google Trends, Google Insights, Google Adwords"
  • "Synonyms of important keywords should also be mentioned a few times in the body of your text, so that the article may be found by someone who does not know the common terminology used in the research field."
When I write an academic paper my primary audience is academics in my specialised field, not the wider public that are likely to use different vocabulary and dominate services like Google Trends by their shear numbers. As an academic reading a paper I wouldn't appreciate the introduction of inconsistency and ambiguity through the use of synonyms, which are necessarily near-synonyms in the precise scientific world.
  • "..to achieve a good ranking in Google Scholar, many citations are essential. Google Scholar seems not to differentiate between self-citations and citations by third parties."
Self citation has always been rife and needs little encouragement. Later they state that "...any articles you have read that relate to your current research paper should be cited"; although surely discretion is an important factor unless we are going to shoe-horn in crap and further exaggerate the Mathew effect of the high ranked papers.
  • "...publish the article on the author's home page...an author who does not have a Web page might post the article on an institutional Web page"
Ignoring the curious turn of phrase, the general consensus is that the vast majority of academics should publish in their institutional repository irrespective of whether they have their own web site. The institutional repositories should have the procedures in place to ensure long-term archiving.
  • "...an article that includes outdated words might be replaced by either updating the existing article or publishing a new version on the author's web site."
As the authors acknowledge "...it may be considered misbehaviour by other researchers." At last we have a point we agree on.

As you have probably guessed from the above criticisms, I thought that the article was a piece of crap. Academic SEO should in no way effect how you write an academic paper, or the subjects we choose to write about. Unfortunately academic SEO is a topic that is likely to get a lot more attention amongst bad scientists if another practice I recently heard of takes off: Paying academics bonuses per article. A colleague told me last week how his former university had a pot of money from which academics were paid €4,000 (split between the number of authors) for articles published in certain 'quality' journals. It is a small step to start paying individuals for articles that reach a certain threshold of citations, at which point we will have finally dumbed-down science.

"Researchers need to think seriously about how to get their articles indexed by academic search engines" - No, they need to think seriously about doing worthwhile research and writing quality publications. If your focus is on SEO then you are in the wrong field.

Labels: , ,

posted by David at | 4 Comments

Friday 5 March 2010

A quick SPARQL of Dbpedia.org says I'm past it!

I've spent the last couple of days having a play around with some of the Linked Data that is increasingly being made available online - data that is made available through dereferencable URIs. One of the most interesting sources is Dbpedia.org, a project that extracts structured data from Wikipedia. Whilst it suffers from a lack of consistency, its crowd-sourced nature potentially offers unique insights into the nature of society (or at least the world as wikipedia users see it).

Today I downloaded a list of all the pages of people in dbpedia with dates of birth in the 20th century. Requests were sent using the SPARQL query language - with only one month requested at a time as dbpedia only provides the first 1,000 results for each query.

SELECT DISTINCT ?page ?dob {
?s foaf:page ?page.
?s ?dob .
Filter (?dob >= "1900-01-01"^^xsd:date) .
Filter (?dob <= "1900-01-31"^^xsd:date) . } Limit 1000


It's not particularly surprising to find that in the current celebrity obsessed world there are more wikipedia-famous people towards the end of the century than at the beginning, and that there are relatively few people under the age of twenty.

At 35 it would seem as though my best years for getting my own wikipedia page are behind me - although as I was never counting on my sporting prowess, there is probably still a chance.

The real power of Linked Data comes not from these data sets in isolation, but investigating how they link together...but you have to start somewhere.

Labels: , ,

posted by David at | 0 Comments

Thursday 4 March 2010

Microscopes and Micrographia

My home office is increasingly turning into a home lab: circuit boards, sensors, switches, wires, wire cutters, soldering iron, even a robot. My latest acquisition is a USB digital microscope with 200x magnification. I've been tempted by the thought of a USB microscope for a while, and whilst there are more powerful microscopes out there, at £29.99 it would have been churlish not to give this one a go.

Unbeknown to the Maplin's sales assistants, their sale was made that much easier by the fact I am currently reading Lisa Jardine's The Curious Life of Robert Hook. The man who through his Micrographia (1665) showed the world at large the hidden details they had never seen before. Painstaking drawing by hand the objects he placed under his slides.


Today the man on the street can pick a USB miscroscope of the shelf, and within minutes share his close-ups of the world. It remains to be seen however, whether it will encourge a generation of entomologists, or navel gazers.

Labels: , ,

posted by David at | 0 Comments