BIG, small or Right Data:

Ricardo Baeza-Yates
7 min readJun 19, 2020

Which is the proper focus?

We find ourselves in the era of big data, where vast, continuous streams of heterogeneous human-related data are collected by digital means, and simplified for consumption according to the 5V characterization: Volume (size of the data), Variety (diversity of the content), Velocity (the rate it’s produced), Veracity (the quality of the content) and Value (it’s business impact). These humongous data sets are collected via many different means including computer networks, social media profiles, web browsing histories, mobile phone sensors…

--

--

Ricardo Baeza-Yates

World-class expert on data science and algorithms. Director of Research at EAI Northeastern University. Former VP of Research at Yahoo Labs. ACM & IEEE Fellow.