IMPLEMENTASI ALGORITMA BYTE PAIR ENCODING UNTUK KOMPRESI FILE

نویسندگان

چکیده

ABSTRACT Data compression is a field that focuses on forming small output file from large file. The need for has given birth to several methods can be implemented in communication activities the network as well storage activities. byte pair encoding algorithm takes advantage of emergence repeated or frequent character pairs. repeating pairs are data and substituted with wildcard where not series One well-known method. implementation system uses Visual Basic 2010 programming language method used this research an applied 
 Keywords: Compression, Byte Pair Encoding, Document.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Byte pair encoding : a text compression scheme that accelerates pattern matching

Byte pair encoding (BPE) is a simple universal text compression scheme. Decompression is very fast and requires small work space. Moreover, it is easy to decompress an arbitrary part of the original text. However, it has not been so popular since the compression is rather slow and the compression ratio is not as good as other methods such as Lempel-Ziv type compression. In this paper, we bring ...

متن کامل

Learning variable length units for SMT between related languages via Byte Pair Encoding

We explore the use of segments learnt using Byte Pair Encoding (referred to as BPE units) as basic units for statistical machine translation between related languages and compare it with orthographic syllables, which are currently the best performing basic units for this translation task. BPE identifies the most frequent character sequences as basic units, while orthographic syllables are lingu...

متن کامل

Klasifikasi Data Cardiotocography Dengan Integrasi Metode Neural Network Dan Particle Swarm Optimization

Backpropagation (BP) adalah sebuah metode yang digunakan dalam training Neural Network (NN) untuk menentukan parameter bobot yang sesuai. Proses penentuan parameter bobot dengan menggunakan metode backpropagation sangat dipengaruhi oleh pemilihan nilai learning rate (LR)-nya. Penggunaan nilai learning rate yang kurang optimal berdampak pada waktu komputasi yang lama atau akurasi klasifikasi yan...

متن کامل

Desain dan Implementasi Face Recognition dan Live Streaming pada Sistem Digital Assistant untuk Staf Medik Fungsional menggunakan Google Glass

Abstrak— Dalam era globalisasi saat ini, rumah sakit dituntut untuk meningkatkan kinerja dan daya saing sebagai badan usaha dengan tidak mengurangi misi sosial yang dibawanya. Hal ini berarti bahwa rumah sakit harus menerapkan kebijakankebijakan strategis agar mampu secara cepat dan tepat dalam pengambilan keputusan sehingga dapat menjadi organisasi yang responsif, inovatif, efektif, dan efisie...

متن کامل

An energy-efficient data cache with byte-repeat pattern encoding

The on-chip cache is a significant source of the energy consumption of today’s processors. Several data compression techniques including Frequent Value Caches are proposed to reduce the energy consumption in the data cache memories. However, the preceding approach has some problems, such as the monitoring time to find the frequent values dedicated for each program and the additional registers t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Media Infotama

سال: 2022

ISSN: ['2723-4673', '1858-2680']

DOI: https://doi.org/10.37676/jmi.v18i2.2716