Reduced Perplexity: Uncertainty measures without entropy
نویسنده
چکیده
C. Shannon revolutionized the design of information systems by showing that the logarithm of independent probabilities were additive and constituted a unit of measure for information. Because ln p is additive, basic analytics could be applied to information, such as the average which forms entropy and the average difference which forms divergence. Unnecessarily neglected in this formative framework for information theory, is the equally important fact that translating entropy back into probability space gives the geometric mean of the distribution, . i p i i p This is also the inverse of the perplexity, but
منابع مشابه
Entropy-based Pruning of Backoff Language Models
A criterion for pruning parameters from N-gram backoff language models is developed, based on the relative entropy between the original and the pruned model. It is shown that the relative entropy resulting from pruning a single N-gram can be computed exactly and efficiently for backoff models. The relative entropy measure can be expressed as a relative change in training set perplexity. This le...
متن کاملAn Entropic Estimator for Structure Discovery
We introduce a novel framework for simultaneous structure and parameter learning in hidden-variable conditional probability models, based on an entropic prior and a solution for its maximum a posteriori (MAP) estimator. The MAP estimate minimizes uncertainty in all respects: cross-entropy between model and data; entropy of the model; entropy of the data’s descriptive statistics. Iterative estim...
متن کاملCharacter-based Language Model
Language modelling and also other natural language processing tasks are usually based on words. I present here a more general yet simpler approach to language modelling using much smaller units of text data: character-based language model (CBLM).1 In this paper I describe the underlying data structure of the model, evaluate the model using standard measures (entropy, perplexity). As a proof-of-...
متن کاملUncertainty measures for sensor management in a survivability application
When flying a mission, a fighter pilot is exposed to the risk of being hit by enemy fire. A tactical support system can aid the pilot by calculating the survivability of a given route, which is the probability that the fighter pilot can fly the route without being hit. The survivability estimate will be uncertain due to uncertainty in the information about threats in the area. In this paper, we...
متن کاملStem-based maximum entropy language models for inflectional languages
In this work we build language models using three different training methods: n-gram, class-based and maximum entropy models. The main issue is the use of stem information to cope with the very large number of distinct words of an inflectional language, like Greek. We compare the three models with both perplexity and word error rate. We also examine thoroughly the perplexity differences of the ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1603.08830 شماره
صفحات -
تاریخ انتشار 2014