| Tweet |
How editorial data is collected, analysed and communicated in digital newsrooms.
About Contact Ask about the project Contribute to this research
“Thirty years ago we didn’t have computer-science departments; now every quality school on the planet has a CS department. Now nobody has a data-science department; in 30 years every school on the planet will have one,” WSJ, Big Data’s Big Problem: Little Talent
Big Data has been sexy for a little while now. Clearly, a lot of new data is being created (1.8 zettabytes this year alone) and in turn, a lot of jobs will spring up as a result (1.5 million additional managers and analysts in the United States alone)
But what I’ve been trying to figure out is where these trained employees are coming from - I’ve seen one class on data mining, but no clear major or direction of study producing these much needed skills. So I asked around - and got these results.
Personally, I think that these things, plus a genuine interest in the types of data being interpreted will introduce a class of smart and passionate interpreters. But until then, how do we make this a more commonly understood position?
| Tweet |
Defining data
Defining data can be a rather esoteric business, but for the purposes of this paper we will stick to the essentials. Data is typically viewed as a constituent of information - that is, the reduced, discrete entities that make up a larger system, set or understanding.
One of the cornerstones of non-fiction writer James Gleick’s enthralling book The Information: A History, A Theory A Flood is the thinking of Claude Shannon, the mathematician founder of Information Theory which views information as strings of bits which have quantities but which have nothing to do with meaning or values like truth or falsehood. [Gleick, J. (2011) The Information: A History, A Theory A Flood. Kindle. New York: Pantheon Books].
Many of the major scientific breakthroughs of the past century have been informational in nature. For instance the revelation that human biological substance is made of coded information patterns and that “our nerves carry messages” occurred only after the invention of the Telegraph, which carries human-made messages through wires. [Kelly, K. & Gleick, J. (2011) Why the Basis of the Universe Isn’t Matter or Energy—It’s Data. Wired.com/magazine [online]. Available from: http://www.wired.com/magazine/2011/02/mf_gleick_qa/all/1 (Accessed 2 April 2012). [online]].
More fundamentally information is, by many contemporary scientific accounts, the constitutional force behind life and the universe. [Landsburg, S. E. (2009) The Big Questions. London: Simon & Schuster.]
The Oxford English Dictionary defines data as a noun: “facts and statistics collected together for reference or analysis.” The singular of data is datum: “a piece of information.” These will be instructive moving forward.
Data in the World
Advances in information technologies are transforming how we communicate with friends and family, changing the way many enterprises do business and eroding the reliability and function of disparate organisational structures on which society and commerce depended for much of human history. [Shirky, C. (2008) Here Comes Everybody. New York: Penguin Group.] Rapid contractions in the price and size of consumer hardware necessary to interface with many of those information technologies have made the consumption and production of digital media both mobile and ubiquitous throughout much of the developed world.
Among the chief consequences of these transformations, as well as advances in industrial computing and interface design technologies, has been an explosion in the global volume of data.
So then, now that every person with a digital device - not to mention the places where they use and nearly any imaginable action performed on those devices - can be turned into a data point, how much data is there in the world?
In information disciplines data is measured in bytes. One byte is roughly the amount of information it takes to represent a single alphanumeric character.
Chief executive of enterprise software company Open Text, Mark Barrenechea, estimates that two and a half exabytes (one exabyte is a quintillion bytes) of computer data are produced every day, which means “every two days humans are putting more data online than what has been put into print since the dawn of recorded history.” [Simone, R. (2012) Power of big data will transform society, Canada 3.0 forum told. The Record [online]. Available from: http://www.therecord.com/news/business/article/711815—power-of-big-data-will-transform-society-canada-3-0-forum-tol (Accessed 28 April 2012).]
According to a 2012 report by the International Data Corporation (IDC), the amount of data available to the global Big Data industry is growing in volume by over 50 percent each year. [International Data Corporation (2012) Worldwide Big Data Technology and Services Forecast. [online]. Available from: http://www.idc.com/getdoc.jsp?containerId=prUS23355112 (Accessed April 16, 2012)].
All of that data, it seems prudent to remark, is being generated in one way or another by the growing population of Internet-connected human beings which surpassed the 2 billion mark near the start of 2011 and doubled in the five years before that. [ Lynn, J. (2010) Internet users to exceed 2 billion this year. http://www.reuters.com/. 19 October. [online]. Available from: http://www.reuters.com/article/2010/10/19/us-telecoms-internet-idUSTRE69I24720101019 (Accessed April 20, 2012)].
How manageable is all of that data? Not very if Eric Schmidt chief executive of Google - the Internet search giant, one of whose corporate missions is to “organise the world’s information - is to be believed. [ Corporate, G. (n.d.) Google’s mission is to organize the world’s information and make it universally accessible and useful. Google Company [online]. Available from: http://www.google.com/about/company/ (Accessed 28 April 2012). [online].
At the 2005 Association of National Advertisers conference, Schmidt estimated the company had indexed roughly 170 of the “5 million terabytes of information out in the world,” which is less than 0.004 percent. A terabyte is 10^12 bytes; ten terabytes is roughly the amount of information in the US Library of Congress’s entire catalogue. Without reference to the Internet’s exponential growth trends in the seven years since his talk (or sigmoidal growth, given infinite, exponential growth is likely impossible [Arthur, C. (2007) Want to impress your friends? Tell them internet growth is sigmoidal, not exponential. Guardian Technology Blog [online]. Available from: http://www.guardian.co.uk/technology/blog/2007/nov/26/wanttoimpressyourfriendst (Accessed 15 April 2012). [online].]), Schmidt predicted it could take Google 300 years to index the world’s data exhaustively and make it searchable. [Mills, E. (2005) Google reveals its 300-year plan. ZDNet IT Strategy [online]. Available from: http://www.zdnet.co.uk/news/it-strategy/2005/10/10/google-reveals-its-300-year-plan-39228011/ (Accessed 16 March 2012).]
In any event, the growth of the Internet is expanding at a blistering pace as measured by any number of factors including number of hosts, speed, and connected population. That growth, along with advances in peripheral technological and design innovations, is among the primary causes of the global swelling of collectible and usable data sets.
Adoption and Value
“Most great revolutions in science are preceded by revolutions in measurement. We have had a revolution in measurement in the last few years,” says economist and professor of data measurement Erik Brynjolfsson. “This revolution in measurement, which started with the switch from analogue to digital data, is as profound as the development of the microscope and what it did for biology and medicine.” [ Eric, B. (n.d.) Competing through Data. [online]. Available from: http://rss.mckinseyquarterly.com/fp/video (Accessed 10 April 2012). [online]. ]
The range of industries for which Big Data is providing applications and value is larger, however, than just biology and medicine.
“The march of quantification, made possible by enormous new sources of data, will sweep through academia, business and government. There is no area that is going to be untouched,” the director of Harvard’s Institute for Quantitative Social Science Gary King told the New York Times. [ Lohr, S. (2012) The Age of Big Data. News analysis on the New York Times Sunday Review [online]. Available from: http://www.nytimes.com/2012/02/12/sunday-review/big-datas-impact-in-the-world.html?pagewanted=all (Accessed 3 April 2012). [online]. ]
Similarly, research on technology and innovation by the McKinsey Global Institute and MGI asserts that “data have swept into every industry and business function and are now an important factor for production, alongside labor and capital. Accordingly, companies are starting to “leverage data-driven strategies to innovate, compete and capture value from deep and up-to-real-time information.” [ Manyika, J. et al. (2011) Big data: The next frontier for innovation, competition, and productivity. McKinsey Insights [online]. Available from: http://www.mckinsey.com/Insights/MGI/Research/Technology_and_Innovation/Big_data_The_next_frontier_for_innovation (Accessed 28 April 2012). [online]. ]
A panel on Big Data at the World Economic Forum 2012 in Davos, Switzerland went so far as to declare data a new class of economic asset, similar to currency or gold. [ World Economic Forum (2012) Big Data, Big Impact: New Possibilities for International Development. [online]. Available from: http://www3.weforum.org/docs/WEF_TC_MFS_BigDataBigImpact_Briefing_2012.pdf (Accessed 1 April 2012). [online]. ]
What is it that make data so valuable and why are the insights they provide so highly prized?
For one, this revolution in measurement is intertwined with innovations in data collection and analysis and expedited by wide adoption of consumer-facing data-input and management platforms like Google, Facebook, Twitter and blogging services, all of which rely on digital data to function in the first place and “make it possible to measure behaviour and sentiment in fine detail and as it happens,” says Brynjolfsson.
Another, more directly commercial instance of data’s value is personalisation. Facebook is able to charge advertisers a higher premium than, say, digital news publishers in part because of how much specific demographic and behavioural data the company has about its users. The DVD rental and video streaming service Netflix has developed an algorithm that analyses data about demographics, location and film preference and processes that into personalised recommendation engines from which, by some estimates, 70 percent of titles are chosen. [ Simone, R. (2012) Power of big data will transform society, Canada 3.0 forum told. The Record [online]. Available from: http://www.therecord.com/news/business/article/711815—power-of-big-data-will-transform-society-canada-3-0-forum-tol (Accessed 28 April 2012). [online]].
In both cases knowing more fine-grained detail about customers increases companies’s ability to make evidence-based decisions about changes and refinements to products and services.
The lion’s share of Google’s revenues come from its ad model, which harnesses the value of data by analysing global search trends and auctioning off text-based ad space against keywords and phrases. Here, data plays several roles.
First, search trends are collected, analysed and made publicly available in the form of Google Analytics. (This platform has more function than just advertising, which we will come back to). This allows advertisers to research relative search volume for specific keywords (how many people are searching for what) and therefore provides business insights into what terms to associate with a product. This is a paradigmatic shift in the nature of advertising-led business models because, as opposed to picking a broad audience grouped around a publication, advertisers now buy visual real-estate around topics in which the consumer in question is known to have some level of interest. (That interest is expressed through the act of searching).
Secondly, click-through data about the ads themselves is collected and used to rank ads purchased against specific search terms. This prevents wealthy advertisers from bidding their way to the top of every high-traffic search term and instills “an economy of relevance and profit” into Google’s ad model. [Battelle, J. (2005) The Search: How Google and Its Rivals Rewrote the Rules of Business and Transformed Our Culture. New York: Penguin Group]. Data usage has brought forth a profitable advertising business model that doesn’t require a single salesperson.
Industries
“This is part of a broader revolution as we move from just financial and numerical data towards all sorts of non-financial metrics,” says Brynjolfsson. So the advantages of making data-based decisions as opposed to relying on “experience and intuition” are applicable to more industries than finance and web giants. [ Lohr, S. (2012) The Age of Big Data. News analysis on the New York Times Sunday Review [online]. Available from: http://www.nytimes.com/2012/02/12/sunday-review/big-datas-impact-in-the-world.html?pagewanted=all (Accessed 3 April 2012). ]
For example the McKinsey study found that with “intelligent, creative use of Big Data,” European Governments could make up €100 billion in operational efficiencies and United States healthcare could “create more than $300 billion in value every year.”
Retailers are processing information about sales, pricing, markets, demographics and weather to make informed decisions about which products to stock at what store locations and when to offer price reductions. Shipping companies are using local traffic and weather data to perfect their delivery routes. Police departments, starting with the NYPD, are learning to use historical arrest patterns, paydays, sporting events, rainfall and vacations to predict where crimes will happen when. Online dating services rely on algorithms to sift through the data in member profiles to look for personal traits and other factors that might allow them to better match couples.
Big Data is also advancing the level of sophistication in interdisciplinary academia and other varieties of institutional research. Although there are numerous incompatible formats in which data can exist, once data is digitised into machine-readable format - and much of the data being produced and worked with today is natively digital - the barriers to compatibility and collaboration break down quickly.
In 2011 for instance, assistant director of the Text and Digital Media Analytics department at the University of Illinois, Kalev Leetaru, used a supercomputer to analyse an archive of 100 million global news articles spanning 30 years. The result was a network of 10 billion people, places and things, and 100 trillion relationships which he used “to forecast the Arab Spring, pinpoint Bin Laden’s location,” and visualise the evolution of human society. [ Leetaru, K. (n.d.) Kalev H. Leetaru online homepage. Kelevleetaru.com [online]. Available from: http://www.kalevleetaru.com/ (Accessed 28 April 2012). [online].]
Future Prospects - Moving into News Production
Given some of the material we’ve already covered on the growth of the Internet and digital data, a somewhat strange picture of the future of commerce emerges. For instance, the top ten in-demand jobs globally in 2010 didn’t even exist in 2004. [ Fisch, K. et al. (2008) Did you know? [online]. Available from: http://www.youtube.com/watch?v=Mmz5qYbKsvM&feature=player_embedded (Accessed 28 April 2012). [online]. ]
Industries that previously had no interest in or use for data are now spending large portions of their budgets on leveraging the analysis of Big Data into meaningful business intelligence and insight. Other areas are transforming from their 20th century analogues into full-fledged data-centric businesses.
To be sure, industry and societal transformations on this scale, taking place at this speed, manufacture problems in education and training. The United States alone will face a shortage of 140,000-190,000 workers with “analytical expertise” and one and half million managers and analysts with the skills to understand and make decisions based on the analysis of Big Data. [ McKinsey Global Institute (2009) Big Data: The next frontier for competition. McKinsey & Company Features [online]. Available from: http://www.mckinsey.com/Features/Big_Data (Accessed 24 April 2012). ]
In the process of moving from a print-only world into a mixed media, digital-first or digital-only one, the journalism and publishing communities are as exposed to the costs of adapting to data-driven environments as the next industry.
The next section will…
| Tweet |
— danah boyd: Whether the digital era improves society is up to its users – that’s us (via greggyour)
(via infoneer-pulse)
| Tweet |
In the hearts of many newsman and woman lies a profound respect for a certain entity, the sublime sanctity of which has been enforced by centuries of hard-nosed proof-readers and dogmatic sub-editors. Since the first printed newspapers roughly a half century after the advent of the printing press in the mid 1400s [Johannes Weber (2006), Strassburg, 1605: The Origins of the Newspaper in Europe”, German History 24 (3): 387-412 (387)], professional structures in journalism production have evolved to foster attention to detail in the matters of spelling, punctuation, grammar, usage, mechanics and typography - in short, to ensure that writers follow house style. Journalists have learned to love and to cherish the part of their role that, for the benefit of society, has kept the written word pristine and effective in the names of clarity, consistency and professionalism.
“The newspaper industry developed a common style, maintained by the Associated Press, to meet the communication needs of a print-based industry trying to most effectively communicate with a broad audience,” says Robert Niles, long-time news site builder and founding editor of the award-winning service journalism website ThemeParkInsider.com. [Robert Niles, (April 21, 2010) http://www.ojr.org/, http://www.ojr.org/ojr/people/robert/201004/1843/, April 21, 2012]. The system seemed to work well.
Then one day in April 2010, a strange thing happened on a strange website called Twitter. The phrase ‘AP Stylebook’ hit the social networking and microblogging platform’s Trending Topics list after the news organisation announced it was changing its house style from ‘web site’ to ‘website’. [@APStylebook, (April 16, 2010) https://twitter.com/, http://twitter.com/#!/APStylebook/status/12296505018, April 21, 2012].
As with many debates whose central issue is reducible enough to accommodate Tweet-length weigh-ins and that have the popular appeal to make it onto Twitter’s Trending list, much of the discussion around the AP’s shift fell into the realm of pro - con commentary. “Everyone but me is cheering AP style change to website,” tweeted University of Florida journalism lecturer Mindy McAdams. “I think it resembles parasite.” [Mindy McAdams, (April 16, 2010) http://www.twylah.com/, http://www.twylah.com/macloo/tweets/12302712045, April 21, 2012].
After several days of the kind of digital discussion that sprawls from its inciting incident (in this case, the AP’s style change) to broader debates about what things mean in wider contexts, Niles published a blog post presenting his own take on the issue at hand. For Niles, the AP’s decision wasn’t about tradition or aesthetics, or really even about style in the sense that anyone but the most meticulous of copy-readers understands the word.
”If you’re publishing online, Google style (i.e. Search Engine Optimisation (SEO)) always trumps AP style,” was one of Niles’ tweeted conclusions. Another was: “J-schools need to ditch AP style and start teaching their students SEO instead. More valuable to their careers.” [Robert Niles, (April 21, 2010) Online Journalism Review, http://www.ojr.org/ojr/people/robert/201004/1843/, April 21, 2012].
What has changed in the world of journalism in recent years to allow Niles even the luxury of considering such a drastic abandon of the industry’s time-honoured stylistic standard? The short answer is that as the dominant media in which journalism is consumed changes from atom-based to bit-based, other things - like how readers find and consume news items and the nature of readers’ relationships to publications - changes as well.
Attention to SEO is, for example, helping writers and editors ensure their words reach the people for whom they’re written and intended. “SEO provides the key to reaching an audience not motivated by existing print brands, including younger readers and readers outside a publication’s traditional search area - folks who might not know to seek out a newspaper website, but who would nevertheless be interested in its content.” [Robert Niles, (April 21, 2010) Online Journalism Review, http://www.ojr.org/ojr/people/robert/201004/1843/, April 21, 2012].
Just as the bygone world of print-only journalism had its own techniques for increasing readership day-to-day (front page story, image and headline selection, supplements, giveaways, special offers), SEO offers digital publishers meaningful opportunities to increase the visibility of their product and to expand and retain audience in today’s search-driven media landscape: “Even as Facebook and social media provide an increasing share of referrals to online news sites, search engines still provide the initial point of entry for millions of new visitors to websites each day. If there are techniques that allow you to jump to the front of the line, to attract more of those potential readers, you need to be using them.” [Robert Niles, (April 21, 2010) Online Journalism Review, http://www.ojr.org/ojr/people/robert/201004/1843/, April 21, 2012].
The Daily Telegraph’s community manager Shane Richmond puts it another way: “The concept is simple. It’s about ensuring that your content is found by the millions of people everyday who use search engines as their first filter for news and those who don’t search at all but trust an automated aggregator, such as Google News, to filter stories for them. These people are essentially asking a computer to tell them the news. If you want your story to be read, you’d better make sure the computer knows what you’re writing about.” [Shane Richmond, (from British Journalism Review Vol 19, No 4, 2008) British Journalism Review, http://www.bjr.org.uk/data/2008/no4_richmond, April 21, 2012].
Moreover, the segment of the commercial online journalism sector looking to fund itself through digital display advertising is currently all but dependent on tactical SEO. “Good SEO can help make your pages more lucrative in keyword-targeted advertising systems, such as Google’s AdWords. Sloppy SEO leads to poorly matched ads, lower click-through rates and less money per click or impression,” says Niles. “If you need new readers to make money, then SEO will help you more than AP style.” [Robert Niles, (April 21, 2010)Online Journalism Review, http://www.ojr.org/ojr/people/robert/201004/1843/, April 21, 2012].
Of course, the dichotomy of AP and other house styles versus SEO is a false one - brought into existence by Niles and others in the heat of an industry debate as a tool to re-angle news analysis so it fits tidily into the “old versus new” narrative paradigm with which most readers are comfortable. Digital newsrooms today continue to maintain and respect house style guides while educating editorial staff-members on best SEO practice and hiring SEO editors whose jobs frequently entail manipulating content and page furniture to optimise for search engines.
Still, some degree of SEO skepticism on the part of established journalism-industry professionals is understandable. The AP Stylebook is so-labelled because it started out as a book; and even though it’s now been digitised, can be updated instantly and is responsive to reader feedback and recommendations, http://www.apstylebook.com is still a relatively static, slow-moving set of guidelines - which is one source of why it functions beautifully as reference material.
But pinning down best SEO practice is different. “Search companies don’t want their techniques to be stolen by rivals or exploited by publishers seeking to gain an unfair advantage. As a result, the rules of SEO are constantly changing as search engines refine their techniques both to improve results and to close loopholes. There are lots of factors at play in SEO.” [Shane Richmond, (from British Journalism Review Vol 19, No 4, 2008) British Journalism Review, http://www.bjr.org.uk/data/2008/no4_richmond, April 21, 2012]
Imagine coming from a world in which style reference material was written down, didn’t change often and was completely transparent. Now imagine someone tells you the new rules can be found in no one central place, depend on whims of companies (in this case, search engine developers) that have nothing to do with the core product you’re selling (in this case, news content) and change constantly based on factors you’re not allowed to know with any precision.
Another, perhaps more rigorous source of resistance to SEO comes from perceptions about editorial practice and priorities. “For many print journalists such tinkering amounts to butchery, and the insistence on keywords renders copy dry and formulaic. On top of this there’s a resistance,” says The Telegraph’s communities editor Shane Richmond, “to the idea that we are somehow reduced to writing for computers rather than for people.” [Shane Richmond, (from British Journalism Review Vol 19, No 4, 2008) British Journalism Review, http://www.bjr.org.uk/data/2008/no4_richmond, April 21, 2012].
Just as there is truth in this statement, there are a number of arguments and nuances to assuage concerns. First of all SEO practice changes almost constantly because the the scope of digital content is constantly expanding and the algorithms that determine the order of results for search terms are being tweaked regularly as well. Why do they change? According to Niles, “to better serve the needs of their users”. [Robert Niles, (April 21, 2010) Online Journalism Review, http://www.ojr.org/ojr/people/robert/201004/1843/, April 21, 2012].
Secondly, “there has always been a certain formula to journalism: focusing on the five Ws - who? what? why? when? and where? - that underpin any good story; writing an article so it can be cut from the bottom; working with the constraints of the publication’s style book -” and SEO can in fact be seen as a tool for strengthening some of these and for adapting and advancing others, notes Richmond. “A keyword-rich story will get names and places in prominent positions and will do so early in the copy. Just as clever headlines, delayed drops and other journalistic tricks evolved to suit the medium, so we will learn new ways to take advantage of the opportunities SEO provides to reach a vast audience.” [Shane Richmond, (from British Journalism Review Vol 19, No 4, 2008) British Journalism Review, http://www.bjr.org.uk/data/2008/no4_richmond, April 21, 2012].
Another aspect of SEO that makes some journalists uncomfortable is something Charlie Brooker suggested in his 2008 Guardian column, namely that writing with search in mind obliges journalists to think along the lines of memetic buzzwords and hyped phrases: “Your modern journalist is expected […] to shoehorn all manner of hot phraseology into copy […] to con people into reading it.” [Charlie Brooker, (July 21, 2008) The Guardian, http://www.guardian.co.uk/commentisfree/2008/jul/21/charliebrooker.pressandpublishing, April 21, 2012]. While this view holds some superficial logic, and indeed sway over some newsrooms, it ignores one of the more central principles that make search engines such widely useful entities - they establish industry-wide conventions about relevance and timeliness. “An irrelevant keyword does you no good at all and in some instances might be harmful.” [Shane Richmond, (from British Journalism Review Vol 19, No 4, 2008) British Journalism Review, http://www.bjr.org.uk/data/2008/no4_richmond, April 21, 2012].
The scope of these concerns though is limited compared to what I believe is the larger issue at hand: how do we live and do business in a world driven more than ever before by the collection, analysis and re-distribution of data? In the case of SEO, it’s a question of data pertaining to interest and topics - that is, what topics (in this case, search terms) are how many people looking for right now?
But “once we know what people are searching for should we write stories to meet that demand?” asks Richmond. “Will search engines end up dictating our news agenda as well as the way we format our stories? If we write stories simply to chase traffic, where do we find the resources to write the specialist stories, the ones that are important to our core readers but not massively popular?” [Shane Richmond, (from British Journalism Review Vol 19, No 4, 2008) British Journalism Review, http://www.bjr.org.uk/data/2008/no4_richmond, April 21, 2012].
Richmond’s answer is that these are editorial and not SEO questions. He writes that “there is nothing to debate when it comes to SEO”, that SEO is “value-neutral” and that “giving readers what they want is a sensible strategy, even though the overall mix of stories within a publication has to be balanced.”
”If an editor wants to devote resources to writing stories based on topics people are searching for, they now have the data that will permit them to do so.” The phasing here is key as in it lies the primary motivation for the research undertaken in this study.
The world today is filled with information far beyond the scope of search engine data for news sites. Digital media and the Internet in particular allows much of that data to be collected, organised, analysed and distributed in formats as arcane as binary code to media as polished as an advertising brochure. Accordingly, industries everywhere are beginning to look at how to use some of that data to increase efficiency, satisfy customers and for dozens of other purposes.
What are those purposes and how are they achieved in digital journalism today? Such is the question from which this research stems.
How do we collect, analyse, repurpose and then distribute data about the editorial and business of journalism?
And crucially, what are the effects of this process on journalistic output?
“Once we know what people are searching for should we write stories to meet that demand? Will search engines end up dictating our news agenda as well as the way we format our stories? If we write stories simply to chase traffic, where do we find the resources to write the specialist stories, the ones that are important to our core readers but not massively popular? // All those concerns are legitimate, but they are not questions about SEO and shouldn’t be interpreted as such. They are editorial questions. If an editor wants to devote resources to writing stories based on topics people are searching for, they now have the data that will permit them to do so. Giving readers what they want is a sensible strategy, even though the overall mix of stories within a publication has to be balanced. Different editors will make different choices, but they are editorial choices, not SEO choices. SEO is value-neutral. It doesn’t require you to dumb down, to fill your stories with the names of celebrities or to write 500 articles about Viagra every month. Even if you write about badgers, thermal dynamics or parachuting you will want your article to be seen by people who care about those topics. SEO techniques will give your article a better chance of being found.”
[source : http://www.bjr.org.uk/data/2008/no4_richmond ]
| Tweet |
This week, I resigned to lots and lots and lots of reading.
Exhaustive lists and noteage to follow but for now, I’m starting to see how the whole thing is going to focus.
One problem I’ve been toying with is the question of how to angle the paper so the title question reflects the most valuable insights in the paper.
Among the things I’m the most excited about is an insight about data and gamification - the argument loosely goes as follows:
- The importance and presence of data has become huge for enterprises and individuals and this applies to journalism and journalists too. (Furthermore, that role is growing - no trouble finding sources to mine and evidence to point to here).
- Because of a number of factors formalised in the fields of social, motivational, media and evolutionary psychology as well as in behavioural economics, any information about a system given to someone associated with that system (consumer, producer, designer, observer, etc) changes that person’s view of and relationship to the system in question.
- Consequently, gamification is just the most elaborately structured form of data provision - a buzzword that represents a kind of pinnacle in our formalised thinking about driving behaviour through engineering incentives for precise, discreet actions within data-reliant and data-driven systems.
- So in thinking about how newsrooms are using and will continue to use data to motivate behaviours and encourage the production of different kinds of content, it’s important to realise that the difference between gamifying a news production environment and giving reporters access to site analytics, even in formats as analogue as twice-daily printed reports, is one of degrees.
That is, there is a spectrum of which gamificaiton is one current extreme and onto which fall everything from geo-tagged, auto-bylined and auto-dated story meta-data, story comment counters, as well as all of the better-known metrics like hits, time on site, time on page, new visitors, etc.
- Past this realisation - supported, obviously, by primary and secondary literature - the role of this paper will be to look at specific news production environments, how content contributors are given access to various data and metrics and how that data is structured and delivered.
The task then is to assess, on a case-by-case basis, what the effects of that data expose have been, are at present and might evolve into in the future.
So obviously, the last bullet point gets into the case studies and analysis area of the paper, while the argument outlined in the four bullet points above will beed to come towards the end of the literature review - I think.
The lead up to this argument will be material about gamification, big data and some of the social sciences, mentioned above, that inform motivational and behavioural theory. This last though will be used more, I think, in the analysis where I can look at the particulars of what different newsrooms are doing and then analyse based on the breadth of literature I’ve been consuming in those fields.
| Tweet |
The purpose of this study is to look at the nascent data and gamification strategies being employed in news production environments to incent the production of news journalism. To the extent it is possible this paper also aims to reveal what the practical consequences of these strategies might be for the people producing news as well as for the nature of news output produced under such systems.
The study will proceed as follows.
Firstly reviewing what gamification means and how it can mean different things to different people in different situations and contexts will remove some of the blurry ambiguity of this buzz-word, the online presence of which Google Trends only acknowledges starting in 2010, and hopefully clear the way for more reasoned discussions moving forward. This will be accomplished through several real-world examples of gamification implementations in a range of industries for a range of desired results.
Secondly we will look at successful and unsuccessful examples of reader-facing gamification systems implemented by news-producing publications with digital presences. Throughout these minor case studies we will look to the fields of motivational psychology and social psychology, behavioural economics and theories of game mechanics - sometimes called game thinking - to try and understand what makes gamification so successful when it is in fact successful and what the limits to that effectiveness might be. This section will bring our discussion of why gamification has been deemed, and in many cases empirically shown to be, such an effective tool for motivating specific behaviours closer the realm of journalism and news production in particular, which is the focus of this study.
Next, a brief discussion of so-called “big data,” its presence in the lives of consumers, enterprises, society and culture will help to view gamification systems in light of the underlying collection and availability of data that makes such systems possible. This will lead to a discussion of data in the news production environment, how digital publications view the wealth of website and reader data available to them and why this matters on both abstract intellectual theoretical levels as well as in day-to-day news production practice.
Another brief section will examine the less-visible, though arguably faster-growing, area of enterprise gamification - that is gamifications of corporate workplace environments, often with the aim of increasing employee efficiency and satisfaction and of incentivising specific behaviours and tasks.
This will lead to a longer section on enterprise gamification at publications and in particular in daily news production environments. Following the earlier discussion about data in newsrooms however, this section will also consider how publications treat site analytics and reader data including how, in what forms and in what frequencies that data is made available to content producers both inside and outside the formal constructs of gamification.
Finally, the study will begin to examine the potential consequences firstly of exposing news content producers to specific site and reader metrics, and secondly of structuring those exposures into strategies with explicit aims in terms of influencing the production of news content.
| Tweet |
It may have started in 2002. After leaving the video game industry and receiving an MBA with distinction at Kingston University, twenty-year video game design veteran Nick Pelling had an idea. By combining business management strategy, game dynamics and computer programming Pelling believed he could integrate gaming interfaces with consumer electronics to make user experiences faster and more enjoyable. The next year Pelling started Conundra, a one-man consultancy dedicated to implementing game mechanics and immersive digital user interfaces.
The Conundra website consists of a single page with one contact link - for Pelling himself - and seven lines of text: “Conundra is a UK-based consultancy specialising in “Gamification”. This means: we help manufacturers evolve their electronic devices into entertainment platforms. We then design, build and run industry partner programmes around new collaborational business models. For these emerging platforms, we can also source, adapt or co-develop games and entertainments.” [Bolds and italics are Pelling’s]. This may have been the first use of the word gamification.
Three years later the company closed due to lack of customer interest. For his part Pelling writes on his meetup.com profile: “I invented gamification (though 7 or 8 years too early, *sigh*)”.
[sources:
http://www.youtube.com/watch?v=j7TllxZuZQc]
Today gamification is a word being used at conferences and tradeshows, in boardrooms and meeting rooms throughout the corporate world and beyond.
| Tweet |
The team at Bunchball was excited to see one of our own, Molly Kittle, named as one of the Top 10 Women in Gamification on Gamification.co. Molly Kittle, our Vice President of Digital Strategy, has been with Bunchball since the early days and has seen us grow from a young upstart to the…
| Tweet |
By Joseph Cole, Digital Strategy Producer, Bunchball
Joseph Cole, @joefcole
There’s a lot of excitement around working for an ad agency. Popular TV shows like Mad Men have really helped to drive the perceived glamour and sexiness of the ad agency world. It’s absolutely true. You do…
| Tweet |
Just as I re-settle into a new dissertation idea and start sweating for material, here comes Nieman Lab, ever fascinating, unmissable, thank you.
A fascinating look at Gawker’s newsroom by Nieman Lab’s Andrew Phelps.
In particular, the results of an experiment in which each staff writer spends one day a week on “traffic-whoring duty” while the rest pursue in-depth articles.
Gawker editor AJ Daulerio explained the experiment back in January:
This week, the writers of this site have all agreed to participate in an obnoxious, but worthwhile exercise. Each day, a different staff writer will be forced to break their usual routine and offer up posts they feel would garner the most traffic. While that writer struggles to find dancing cat videos and Burger King bathroom fights or any other post they feel will add those precious, precious new eyeballs, the rest of the staff will spend time on more substantive stories they may have neglected due to the rigors of scouring the internet each day to hit some imaginary quota. The writers not relegated to traffic-whoring duty will still post, just less frequently than many of them are probably used to.
Andrew Phelps, Nieman Lab. I can’t stop reading this analysis of Gawker’s editorial strategy.
| Tweet |
The New Yorker’s signature display typeface, used for its nameplate and headlines and the masthead above The Talk of the Town section, is Irvin, named after its creator, the designer-illustrator Rea Irvin.[17]
The body text of all articles in The New Yorker is set in Adobe Caslon.[18]
[from http://en.wikipedia.org/wiki/The_New_Yorker#cite_note-caslonref-17]
See also http://typesites.com/the-new-yorker/
| Tweet |


