随机变量和概率分布之间的差异|比较类似术语之间的差异

Random Variables vs Probability Distribution

统计实验是随机实验，可以通过一组已知的结果无限期地重复进行。随机变量和概率分布都与此类实验相关联。对于每个随机变量，存在一个由称为累积分布函数的函数定义的关联概率分布。

什么是随机变量？

A random variable is a function that assigns numerical values to the outcomes of a statistical experiment. In other words, it is a function defined from the sample space of a statistical experiment into the set of real numbers.

例如，考虑两次翻转硬币的随机实验。可能的结果是HH，HT，TH和TT（H - 头，T - 故事）。让变量x为实验中观察到的头数。然后，x可以采用值0、1或2的值，并且是一个随机变量。在这里，随机变量x将映射S = {HH，HT，TH，TT}（示例空间）的集合为{0，1，2}以HH映射到2，HT和TH的方式{0，1，2}被映射到1，tt映射到0。在功能符号中，可以将其写入，x：s→r，其中x（hh）= 2，x（ht）= 1，x（th）= 1和x（tt）= 0。

随机变量有两种类型：离散且连续的变量，因此，随机变量可以假定的可能值数量最多是可计数的。在上一个示例中，随机变量x是一个离散的随机变量，因为{0，1，2}是有限集。现在，考虑在课堂上找到学生体重的统计实验。令y为定义为学生的权重的随机变量。y可以在特定间隔内采用任何实际值。因此，y是一个连续的随机变量。

什么是概率分布？

概率分布是一个描述随机变量采用某些值的概率的函数。

一个称为累积分布函数（F）的函数可以从实际数字集定义为实数的集合，为f（x）= p（x≤x）（x的概率小于或等于x）每个可能的结果x。现在，第一个示例中X的累积分布函数可以写为f（a）= 0，如果a <0;f（a）= 0.25，如果0≤a<1;f（a）= 0.75，如果1≤a<2，而f（a）= 1，则如果a≥2。

In case of discrete random variables, a function can be defined from the set of possible outcomes to the set of real numbers in such a way that ƒ(x) = P(X = x) (the probability of X being equal to x) for each possible outcome x. This particular function ƒ is called the probability mass function of the random variable X. Now the probability mass function of X in the first particular example can be written as ƒ(0)=0.25, ƒ(1)=0.5, ƒ(2)=0.25, and ƒ(x)=0 otherwise. Thus, probability mass function along with the cumulative distribution function will describe the probability distribution of X in the first example.

在连续的随机变量的情况下，对于每个x而言，可以将称为概率密度函数（ƒ）的函数定义为ƒ（x）= df（x）/dx，其中f是连续随机变量的累积分布函数。很容易看到此函数满足∫ƒ（x）dx = 1。概率密度函数以及累积分布函数描述了连续随机变量的概率分布。例如，使用概率密度函数ƒ（x）= 1/√（2πσσ一）来描述正态分布（这是连续概率分布）²）e^（[（x-µ）]²/(2σ²）。

随机变量和概率分布之间有什么区别？

• Random variable is a function that associates values of a sample space to a real number.

•概率分布是一个将随机变量可以与发生概率相关的值关联的函数。