2.4 The Vs

  • 3 Vs go back to Laney (2001)
    • Volume: e.g. Facebook’s data warehouse Hive, with 300 petabytes (300 million GB) of data in 800,000 tables. (Source)
    • Variety: Can you think of examples?
    • Velocity
      • e.g. Facebook generates 4 new petabyes of data per day
      • e.g. Facebook: 2700000000 Likes, 300000000 Photos per day (Source).. around 31250 Likes, 3472 Photos per second…
      • e.g. searches per second on Google (hard to estimate)
  • Additional Vs
    • Veracity/validity: Measurement quality/error (Monroe 2013, 5)
    • Value: What is the value of such data?
    • Vinculation:
      • to vinculate = “bind together”; emphasizes interdependent nature of social data (Monroe 2013, 4–5)

References

Laney, Doug. 2001. “3D Data Management: Controlling Data Volume, Velocity and Variety.” META Group Research Note 6 (70): 1.

Monroe, Burt L. 2013. “The Five Vs of Big Data Political Science Introduction to the Virtual Issue on Big Data in Political Science Political Analysis.” Polit. Anal. 21 (V5). Cambridge University Press: 1–9.