IDC forecasts that by 2025 the global datasphere will grow to 163ZB (that is a trillion gigabytes) which is also exactly ten times the 16.1ZB of data generated in 2016.
Data is all around us but at the moment, less than 0.5% of the data generated is categorised and analysed. Every second of every day we create new data. According to Forbes, for example, we perform 40,000 search queries every second on Google alone, which makes 3.5 million searches per day or 1.2 trillion every year
We are seeing and experiencing, and indeed as consumers, creating, huge volumes of analysable data every day, which at its most intrinsic level is categorised by the volume of words, emojis, reactions, images and video content we upload to social channels each day. Some of this is relevant to organisations, much isn’t, but finding the signals in the noise is the key to great data gathering and analytics.
Each and every day we ingest into our software and infrastructure millions of media items and add hundreds of millions of additional data points from:
Print: 30,000 items per day
Broadcast: 13,000 summaries per day
Online News: 1,200,000 items per day
Forums: 3,200,000 posts per day
Blog posts: 1,000,000 posts per day
Twitter: 4,500,000 tweets per day
Sina Weibo: 250,000 posts per day
YouTube: 6,000 new videos per day
Facebook Pages: 300,000 posts and comments per day
WeChat: 500,000 posts per day
The insights and research we create are always predicated on access to the best data and in this regard, which is why we source data from 4,400 print sources; 1,750 broadcast programs; 62,500 online news sites; 6 million blogs; 300,000 forums and a huge range of social networks including: Twitter, Facebook, Reddit, Pinterest, YouTube, Google+, Instagram, WeChat, Sina Weibo, Tencent QQ, and Tumblr.
In addition to our own content ingest, we partner with a number of global leaders in data gathering, for example Datasift in their development and interrogation of leading social media networks and channels.