Notes on a New Philosophy of Empirical Science

04/28/2011
by   Daniel Burfoot, et al.
0

This book presents a methodology and philosophy of empirical science based on large scale lossless data compression. In this view a theory is scientific if it can be used to build a data compression program, and it is valuable if it can compress a standard benchmark database to a small size, taking into account the length of the compressor itself. This methodology therefore includes an Occam principle as well as a solution to the problem of demarcation. Because of the fundamental difficulty of lossless compression, this type of research must be empirical in nature: compression can only be achieved by discovering and characterizing empirical regularities in the data. Because of this, the philosophy provides a way to reformulate fields such as computer vision and computational linguistics as empirical sciences: the former by attempting to compress databases of natural images, the latter by attempting to compress large text databases. The book argues that the rigor and objectivity of the compression principle should set the stage for systematic progress in these fields. The argument is especially strong in the context of computer vision, which is plagued by chronic problems of evaluation. The book also considers the field of machine learning. Here the traditional approach requires that the models proposed to solve learning problems be extremely simple, in order to avoid overfitting. However, the world may contain intrinsically complex phenomena, which would require complex models to understand. The compression philosophy can justify complex models because of the large quantity of data being modeled (if the target database is 100 Gb, it is easy to justify a 10 Mb model). The complex models and abstractions learned on the basis of the raw data (images, language, etc) can then be reused to solve any specific learning problem, such as face recognition or machine translation.

READ FULL TEXT
research
05/27/2010

Compression Rate Method for Empirical Science and Application to Computer Vision

This philosophical paper proposes a modified version of the scientific m...
research
06/30/2022

Deep Learning to See: Towards New Foundations of Computer Vision

The remarkable progress in computer vision over the last few years is, b...
research
07/07/2006

Ten Incredibly Dangerous Software Ideas

This is a rough draft synopsis of a book presently in preparation. This ...
research
05/10/2023

When ChatGPT for Computer Vision Will Come? From 2D to 3D

ChatGPT and its improved variant GPT4 have revolutionized the NLP field ...
research
08/24/2022

22 Examples of Solution Compression via Derandomization

We provide bounds on the compression size of the solutions to 22 problem...
research
06/23/2020

On Compression Principle and Bayesian Optimization for Neural Networks

Finding methods for making generalizable predictions is a fundamental pr...
research
09/01/2020

A Short Review on Data Modelling for Vector Fields

Machine learning methods based on statistical principles have proven hig...

Please sign up or login with your details

Forgot password? Click here to reset