1 Introduction
In recent years many papers have been devoted to the analysis and applications of negations of finite probability distributions (PD) [2-14], first considered by R. Yager in [1].
This paper gives a brief overview of some formal results on the definition and properties of negations of PD. Negations of PD are generated by negators of probability values transforming element-by-element PD into a negation of PD [2]. Negators are non-increasing functions of probability values that can be PD-independent or PD-dependent.
It was shown that Yager's negator [1] is fundamental in the characterization of linear PD-independent negators as a convex combination of Yager's negator and uniform negator [2].
Negators give new tools for the transformation of probability distributions. They are similar to negations in fuzzy logic [15, 16]. The concepts of contracting and involutive negations studied in fuzzy logic [15,16] can be extended on negators of probability values [3].
Involutive negations are important for logic. Such involutive negation can be introduced also for probability distrbutions [3].
The paper proposes a new simple representation of linear negators generalizing Yager's negator, and compare them with other known negators of PD. The paper discusses some open problems in the analysis of negations of probability distributions.
The paper has the following structure. Section 2 gives some basic definitions of negators of probability values in PD and negations of PD. Sections 3 and 4 consider PD-independent and linear negators. Section 5 considers the influence of negations of PD on the entropy of PD. Sections 6 and 7 consider PD-dependent and involutive negators. Section 8 introduces a new, simple representation of linear negators. Section 9 contains examples. Section 10 contains the conclusion and discusses future work.
2 Negations of Probability Distributions
A probability distribution (PD) of length n is a sequence P=(p1,…,pn) of n real values satisfying for all i=1,…,n, (n≥2), the properties:
Let Pn be the set of all probability distributions of the length n. A negation of probability distribution is a function neg:Pn→Pn such that for any PD P=(p1,…,pn) in Pn the probability distribution
satisfies for all i=1,…,n, the following properties:
From (2), it follows:
The property (3) is similar to the property of negation of membership and truth values in fuzzy logic [15, 16], defined as a decreasing function on [0,1]. The essential difference between negation (complement) of fuzzy sets and negation of PD is that the negation Q=neg(P) of PD P should satisfy the following properties of probability distributions:
A function N, transforming elements pi of PD P into elements qi=N(pi) of PD Q=neg(P) is called a negator.
From (5),(6), and (3), it follows that for all i=1,…,n the negator satisfies the following properties:
In terms of negators, a negation of probability distribution P=(p1,…,pn) can be presented as follows:
and we will say that PD P is generated by negator N. The general method of constructing negators was proposed in [2].
Theorem 1 [2]. Let P=(p1,…,pn) be a probability distribution and f(p) be a non-negative non-increasing real-valued function satisfying the property: ∑nj=1f(pj)>0, then the function N defined for all i=1,…,n by:
is a negator.
Generally, a negator depends on the set of values pi in P, but in the following Section, we consider PD-independent negators N(p) defined on [0,1] and depending only on value p in [0,1].
The probability distribution
is called the uniform distribution.
A probability distribution P is called a fixed point of a negation neg of PD if
Theorem 2 [2]. The uniform distribution is a fixed point of any negation neg of probability distributions, that is:
3 PD-Independent Negators
A PD-independent negator N is a non-increasing function N:[0,1]→[0,1] such that for any p,q in [0,1], it is fulfilled:
and for any PD P=(p1,…,pn) in Pn it is fulfilled:
PD-independent negator N(p) depends only on the value of p∈[0,1], and at first glance, it isn't easy to expect the existence of a PD-independent negator such that for any PD P in Pn the property (15) will hold. But formula (11) can also be used for constructing PD-independent negations, as shown in Section 8.
It is easy to see that Yager's negator [1]:
is a PD-independent negator. It is a decreasing function of p on [0,1]. The Yagers's negator defines the following negation of a probability distribution P=(p1,…,pn):
and ∑ni=1NY(pi)=1.
Another PD-independent negator called uniform negator was proposed in [2]:
This negator defines the transformation of any PD P into the uniform distribution:
Theorem 3 [2]. Any PD-independent negator N has the unique fixed point p=1n, and any PD-independent negation of probability distributions negN has a unique fixed point PU.
Hence, for any PD-independent negator 𝑁 we have:
Theorem 4 [2]. For any PD-independent negator N, the following properties are satisfied:
For example, we have for p=0 and p=1:
4 Linear Negators
A PD-independent negator N is called linear if N(p) is a linear function of p∈[0,1]. A negation of PD generated by a linear negator is called a linear negation of PD. One can see that Yager's negator and uniform negators are linear negators. The convex combination of Yager's negator and uniform negator was used in [2] for constructing a class of linear PD-independent negators. This class of negators is considered below.
Theorem 5 [2]. A PD-independent negator N(p) is a linear negator if and only if it is a convex combination of negators NU and NY, i.e. for some α∈[0,1] for all p in [0,1] it holds:
From (21) it follows: nN(1)∈[0,1], and using α=nN(1) from (22), we can obtain:
From (23) and (21), we see that any linear negator can be obtained from Yager's negator by suitable selection of the value of N(1) in the interval [0,1n]. When N(1)=0, formula (23) gives the Yager's negator. When N(1)=1n, we obtain the uniform negator N(p)=N(1)=1n.
Using p=0 in (23), we can obtain:
and represent linear negator as a function of N(0):
In [2], one can find a graphical representation of linear negators.
5 Linear Negations and Entropy of PD
Theorem 6 [3]. For linear negator N for any p in [0,1], it holds
where N1(p)=N(p), Nk(p)=N(Nk−1(p)) for k=2,…
Theorem 7 [3]. If N is a linear negator, then for the corresponding linear negation of PD negN(p)=(N(p1),…,N(pn)) for any PD P in Pn it holds:
where negkN(P)negN(negk−1N(P)) for k=2,… and neg1N(P)=negN(P).
As it follows from Theorem 7, the multiple linear negations of probability distributions have as the limit the uniform distribution PU=(1n,…,1n) with the maximal entropy [1] value:
The similar result was obtained for PD-dependent negators NZ [14] and NW [10] considered below.
6 PD-Dependent Negators
PD-independent negator N:[0,1]→[0,1] is a function of probability value p from [0,1], and N(p) depends only on the value p, but not depends on PD where this value appears. Hence, if N is PD-independent negator, then for any PD P=(p1,…,pn) and Q=(q1,…,qn) in Pn, it holds:
A negator N that is not PD-independent is referred to as PD-dependent. Generally, for PD-dependent negator, the values N(pi) for probability values pi in a PD P=(p1,…,pn) depend on other values pj in P, and (25) does not fulfill. Below are examples of PD-dependent negators of PD. Note that both of them have the form (11).
Negator of Zhang et al. based on Tsallis entropy [14]:
Exponential negation of Wu et al. [10]:
7 Involutive Negations and Negators
Considered above negators N define negations neg(P) transforming probability distributions P in Pn into PD neg(P) in Pn. If PD P simulates the term HIGH PRICE then its negation neg(P) can simulate the term NOT(HIGH PRICE). It is reasonable like in logic to have involutive negations satisfying to: NOT(NOT(HIGH PRICE))=HIGH PRICE.
A negation neg of probability distributions is involutive if for any PD P in Pn it is fulfilled:
A PD-independent negator N is involutive if for any p in [0,1] it is fulfilled:
Theorem 8 [3]. Any PD-independent negator N is non-involutive.
As follows from the Theorem 8, involutive negator should be PD-dependent. It is easy to show that PD-dependent negators considered in the previous Section are non-involutive.
Such an involutive negator was introduced by Batyrshin [3]. Let P=(p1,…pn) be a probability distribution. Denote max(P)=maxi{pi}=max{p1,…,pn}, min(P)=mini{pi}=min{p1,…,pn} and MP=max(P)+min(P).
Theorem 9 [3]. Let P=(p1,…pn) be a probability distribution. Then the function:
i=1,…,n, is an involutive negator.
We have NB(1n)=1n, i.e. 1n is the fixed point of Batyrshin's negator NB. Note that negator (28) is PD-dependent. When one calculates Q=negN(P)=(NB(p1),…,NB(p1)) the negator NB(pi) uses in (28) maximal and minimal values of P. But when one will calculate negN(negN(P))=negN(Q)=(NB(q1),…,NB(q1)) in the formula (28) instead of max(P)+min(P) one should calculate max(Q)+min(Q).
The simplified version of involutive negation (28) for PD with min(P)=0 was independently proposed by Pham et al. [8].
8 A New Representation of Linear Negators
Here we propose a new representation of linear negators:
This negator can be considered as a straightforward generalization of Yager's negator that will be obtained from (29) when d=1. Below we show how it can be constructed from the general method of construction of negators (11).
Using (11), we can represent linear negator as a linear function as follows:
It is clear that a≠0, otherwise N(pi)=pi and N will be increasing, which contradicts (9). Hence we can represent N(pi) as follows:
Since a negator is a non-increasing function, parameter c is non-positive: c≤0. Replacing it by a non-negative parameter d≥0, we obtain:
From (18) and (19) it follows NL(p)∈[0,1n−1] that gives: d≤1. Replacing (30) by the function defined on [0,1], we obtain a new representation for linear PD-independent negators:
Changing in (31) parameter d from 0 to 1, we obtain different linear negators. For d=0, we have uniform negator: NL(p)=NU(p)=1n, and when d=1, we obtain Yager's negator: NL(p)=NY(p)=1−pn−1. We obtain (24) and (23) from (31) after substitutions:
(32) together with (20) and (21) again gives d∈[0,1].
For p=1n we have in (31):
i.e., p=1n is a fixed point of linear negators.
In the following Section we consider examples of negations of PD using different negators.
9 Examples
Consider probability distributions from P5, (n=5): P=(0,0.1,0.2,0.3,0.4) and Q=(0.1,0.1,0.1,0.3,0.4), with uniform distribution PU=(0.2,0.2,0.2,0.2,0.2).
Compare the results of negation of probability distributions P and Q using different negators.
For uniform negator NU (17) we have:
negU(P)=negU(Q)=(0.2,0.2,0.2,0.2,0.2)=PU.
For Batyrshin's involutive negator NB (28) we have:
negB(P)=(0.4,0.3,0.2,0.1,0),negB(negB(P))=(0,0.1,0.2,0.3,0.4)=P.negB(Q)=(0.267,0.267,0.267,0.133,0.067),negB(negB(Q))=(0.1,0.1,0.1,0.3,0.4)=Q.
For Yager's linear negator NY (16) we have:
negY(P)=(0.250,0.225,0.2,0.175,0.150),negY(negY(P))=(0.188,0.194,0.2,0.206,0.213).
For linear negator NL (31) with d=0.5 we have:
negL(P)=(0.222,0.211,0.2,0.189,0.178),negL(negL(P))=(0.198,0.199,0.2,0.201,0.202),negL(Q)=(0.211,0.211,0.211,0.189,0.178).
For negator NZ of Zhang et al. (26) with k=2 we have:
negZ(P)=(0.213,0.211,0.204,0.194,0.179),negZ(negZ(P))=(0.199,0.199,0.200,0.201,0.202),negZ(Q)=(0.210,0.210,0.210,0.193,0.178).
For negator NW of Wu et al. (27), we have:
negW(P)=(0.242,0.209,0.198,0.179,0.162),negW(negW(P))=(0.192,0.196,0.200,0.204,0.208),negW(Q)=(0.219,0.219,0.219,0.180,0.162).
These examples illustrate the main properties of negators and corresponding negations of PD:
We see that the values of P and Q are increasing and the values of neg(P) and neg(Q) are decreasing because the negators are decreasing functions.
The uniform negator transforms any PD into the uniform PD PU.
For the Batyrshin's involutive negator NB we have: negB(negB(P))=P, but for other negators the involutivity property does not fulfil: neg(neg(P))≠P.
The involutive negator NB and linear negators NY and NL keep the fixed point: N(1n)=1n. In our examples we have: n=5, 1n=0.2, and N(0.2)=0.2. For PD-dependent negators NZ and NW this property generally does not fulfil.
The values N(pi) of PD-independent linear negators depend only on negated probability value pi but for PD-dependent negators they depend also on other probability values pj in PD. Compare negations of values 0.1, 0.3 and 0.4 in PD P and Q by linear negator NL and PD-dependent negators NZ and NW.
All considered non-involutive negators transform PD P into neg(P) and neg(neg(P)) approaching uniform distribution, in our case PU=(0.2,0.2,0.2,0.2,0.2). This property proved for linear negators, see Theorems 6 and 7, and for NZ and NW, but generally not necessary fulfilled for PD-dependent negators.
10 Conclusion and Future Work
The paper observed the main definitions and formal properties of negators and corresponding negations of probability distributions. The examples illustrate these properties. We did not consider here contracting negations of PD [3] also studied in fuzzy logic [15, 16].
In [3], it was proved that any linear negator except for uniform negator NU is strictly contracting. As a result, multiple linear negations increase the entropy of probability distributions and converge to the uniform distribution PU. We plan to study negations of PD as transformations of PD-dependent negations also in terms of similarity and distance between PD [17,18].
Another direction of possible research is related to the linguistic interpretation of PD proposed by Yager [1]. We can use similar approaches to linguistic descriptions of other types of data [19].
Also, we plan to consider possible applications of negations of PD, in Dempster-Shafer theory, like in [1] and some other papers from the list of references.
The open problems are: