Rate this book

Managing Gigabytes: Compressing and Indexing Documents and Images

Name: Managing Gigabytes: Compressing and Indexing Documents and Images
Rating: 4.19 (4 reviews)
ISBN: 9781558605701

Ian H. Witten, Alistair Moffat, Timothy C. Bell

Rate this book

In this fully updated second edition of the highly acclaimed Managing Gigabytes , authors Witten, Moffat, and Bell continue to provide unparalleled coverage of state-of-the-art techniques for compressing and indexing data. Whatever your field, if you work with large quantities of information, this book is essential reading--an authoritative theoretical resource and a practical guide to meeting the toughest storage and access challenges. It covers the latest developments in compression and indexing and their application on the Web and in digital libraries. It also details dozens of powerful techniques supported by mg, the authors' own system for compressing, storing, and retrieving text, images, and textual images. mg's source code is freely available on the Web.

GenresComputer ScienceSoftwareComputersProgramming

560 pages, Hardcover

First published June 2, 1994

About the author

Ian H. Witten

17 books11 followers

What do you think?

Rate this book

Friends & Following

Create a free account to discover what your friends think of this book!

Community Reviews

5 stars

26 (41%)

4 stars

24 (38%)

3 stars

12 (19%)

2 stars

1 (1%)

1 star

0 (0%)

Displaying 1 - 4 of 4 reviews

Chris

142 reviews42 followers

August 31, 2019

The title will make you laugh in 2020, so why would I recommend this citation from stanford.edu/~backrub more than not only the gushing river of nonsense that is the last 5 years of arXiv.ml but even more than the comparatively solid books from 2010 and 2000 on statistics and ML?

Because

[[draft review. goodreads does not have a save function.]]

Much of what machine-learning is about *isn’t* on-the-fly computation, it’s about storing good representations which then index

In an online world where HN has way too much mindshare, it’s relaxing to step back to the days of payphones, cassette voice mails, and yellow page directories.

computer machine-learning

Dave Peticolas

1,377 reviews46 followers

October 8, 2014

This is probably a bit dated due to advances in the state of the art, but it is still a great introduction to the topic of document storage and search.

Natalia Padovani

3 reviews

June 21, 2023

Compressing and Indexing Documents and Images by Ian H. Witten is an essential resource for anyone dealing with large data volumes. This comprehensive guide covers the principles, techniques, and methodologies of data compression and indexing. The book strikes a perfect balance between theory and practical applications, with clear explanations and helpful examples. Its attention to detail and coverage of both textual documents and image data make it a valuable asset for professionals in information management. Highly recommended for those seeking efficient data handling strategies.

This book was instrumental in my research and writing process for an article I recently produced on global gigabyte pricing. It provided me with the necessary insights and knowledge to analyze and understand the complexities of data compression and indexing, enabling me to draw meaningful conclusions. If you're interested, you can find the article at https://hellosafe.pt/telecomunicacoes...

Ernestas Poskus

198 reviews8 followers

November 15, 2021

Refresher on Huffman codes, bitmaps, indexing, compression of images, textual images. A book is a bit old author is still concerned about gigabytes, nevertheless many practices are still applicable today.

Displaying 1 - 4 of 4 reviews

Join the discussion

Managing Gigabytes: Compressing and Indexing Documents and Images

Ian H. Witten, Alistair Moffat, Timothy C. Bell

About the author

Ian H. Witten

Ratings & Reviews

Friends & Following

Community Reviews

Join the discussion