DeepAI AI Chat
Log In Sign Up

How Big Are Peoples' Computer Files? File Size Distributions Among User-managed Collections

by   Jesse David Dinneen, et al.

Improving file management interfaces and optimising system performance requires current data about users' digital collections and particularly about the file size distributions of such collections. However, prior works have examined only the sizes of system files and users' work files in varied contexts, and there has been no such study since 2013; it therefore remains unclear how today's file sizes are distributed, particularly personal files, and further if distributions differ among the major operating systems or common occupations. Here we examine such differences among 49 million files in 348 user collections. We find that the average file size has grown more than ten-fold since the mid-2000s, though most files are still under 8 MB, and that there are demographic and technological influences in the size distributions. We discuss the implications for user interfaces, system optimisation, and PIM research.


page 1

page 2

page 3

page 4


Mac Users Do It Differently: the Role of Operating System and Individual Differences in File Management

Despite much discussion in HCI research about how individual differences...

The ubiquitous digital file: A review of file management research

Computer users spend time every day interacting with digital files and f...

Detecting Layout Templates in Complex Multiregion Files

Spreadsheets are among the most commonly used file formats for data mana...

Making I/O Virtualization Easy with Device Files

Personal computers have diverse and fast-evolving I/O devices, making th...

A distributed file system for a wide-area high performance computing infrastructure

We describe our work in implementing a wide-area distributed file system...

An Experiment with Hierarchical Bayesian Record Linkage

In record linkage (RL), or exact file matching, the goal is to identify ...

Towards Marrying Files to Objects

To deal with the constant growth of unstructured data, vendors have depl...