Document Type
Report
Abstract
The Unicode standard provides several algorithms, techniques, and strategies for assigning, transmitting, and compressing Unicode characters. These techniques allow Unicode data to be represented in a concise format in several contexts. In this paper we examine several techniques and strategies for compressing Unicode data using the programs gzip and bzip. Unicode compression algorithms known as SCSU and BOCU are also examined. As far as size is concerned, algorithms designed specifically for Unicode may not be necessary.
Publication Date
7-31-2002
Recommended Citation
Atkin, S.E., Stansifer, R. (2002). Unicode compress: does size really matter? (CS-2002-10). Melbourne, FL. Florida Institute of Technology.