Mean Average Precision（MAP）平均精度均值

MAP(Mean Average Precision)：单个主题的平均准确率是每篇相关文档检索出后的准确率的平均

值。主集合的平均准确率(MAP)是每个主题的平均准确率的平均值

。 MAP 是反映系统在全部相关文档上性能的单值指标。系统检索出来的相关文档越靠前(rank 越高

)，MAP就可能越高。如果系统没有返回相关文档，则准确率默认为

0。

例如：假设有两个主题，主题1有4个相关网页

，主题2有5个相关网页。某系统对于主题1检索出4

个相关网页，

其rank分别为1, 2, 4, 7；对于主题2检索出3个相关网页，其rank分别为1,3,5

。对于主题1，平均准确率为(1/1+2/2+3/4+4/7)/4=0.83。对于主题 2，平均准确率为(1/1+2/3+3/5+0+0)/5=0.45。则MAP=

(0.83+0.45)/2=0.64。

MRR是把标准答案在被评价系统给出结果中的排序取倒数作为它的准确度，再对所有的问题取平均。

MAP可以由它的三个部分来理解：P,AP,MAP

先说P（Precision）精度，正确率。在信息检索领域用的比较多，和正确率一块出现的是找回率Recall。

对于一个查询，返回了一系列的文档，正确率指的是返回的结果中相关的文档占的比例，定义为：

precision=返回结果中相关文档的数目/返回结果的数目；

而召回率则是返回结果中相关文档占所有相关文档的比例，定义为：

Recall=返回结果中相关文档的数目/所有相关文档的数目。

正确率只是考虑了返回结果中相关文档的个数，没有考虑文档之间的序。对一个搜索引擎或推荐系统而言返回的结果必然是有序的，而且越相关的文档排的越靠前越好，于是有了AP的概念。对一个有序的列表，计算AP的时候要先求出每个位置上的precision，然后对所有的位置的precision再做个average。如果该位置的文档是不相关的则该位置

precision=0.

Precision

Main article: Precision and recall

Precision is the fraction of the documents retrieved that are relevant to the user’s information need.

precision = | { relevant documents } \cap { retrieved documents } | | { retrieved documents } |

In binary classification, precision is analogous to positive predictive value. Precision takes all retrieved documents into account. It can also be evaluated at a given cut-off rank, considering only the topmost results returned by the system. This measure is called precision at n or P@n.

Note that the meaning and usage of “precision” in the field of information retrieval differs from the definition of accuracy and precision within other branches of science and statistics.

Recall

Main article: Precision and recall

Recall is the fraction of the documents that are relevant to the query that are successfully retrieved.

recall = | { relevant documents } \cap { retrieved documents } | | { relevant documents } |

In binary classification, recall is often called sensitivity. So it can be looked at as the probability that a relevant document is retrieved by the query.

It is trivial to achieve recall of 100% by returning all documents in response to any query. Therefore, recall alone is not enough but one needs to measure the number of non-relevant documents also, for example by computing the precision.

Average precision

Precision and recall are single-value metrics based on the whole list of documents returned by the system. For systems that return a ranked sequence of documents, it is desirable to also consider the order in which the returned documents are presented. By computing a precision and recall at every position in the ranked sequence of documents, one can plot a precision-recall curve, plotting precision

p

(

r

)

as a function of recall

r

. Average precision computes the average value of

p

(

r

)

over the interval from

r

=

0

to

r

=

1

AveP = \int 10 p (r) d r

That is the area under the precision-recall curve. This integral is in practice replaced with a finite sum over every position in the ranked sequence of documents:

AveP

=

∑

k

=

1

n

P

(

k

)

Δ

r

(

k

)

where

k

is the rank in the sequence of retrieved documents,

n

is the number of retrieved documents,

P

(

k

)

is the precision at cut-off

k

in the list, and

Δ

r

(

k

)

is the change in recall from items

k

−

1

to

k

This finite sum is equivalent to:

AveP = \sum n k = 1 ( P ( k ) \times rel ( k ) ) number of relevant documents

Mean average precision

Mean average precision for a set of queries is the mean of the average precision scores for each query.

MAP = \sum Q q = 1 A v e P ( q ) Q

where Q is the number of queries.

你可能也喜欢