I am following your work CodeXGLUE/Code-Code/Clone-detection-POJ-104/ <p dir="auto

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

the implementation seems ok, and you may want to update here. <a target="_blank" r

Problem about calculating the MAP score in CodeXGLUE/Code-Code/Clone-detection-POJ-104/ about codexglue HOT 5 CLOSED

microsoft commented on May 12, 2024

Problem about calculating the MAP score in CodeXGLUE/Code-Code/Clone-detection-POJ-104/

from codexglue.

Comments (5)

guody5 commented on May 12, 2024

Hi @deplay, thanks for your pointing problem.
I have modified the metric calculation.

def calculate_scores(answers,predictions):
    scores=[]
    for key in answers:
        if key not in predictions:
            logging.error("Missing prediction for index {}.".format(key))
            sys.exit()

        answer = set(answers[key])   

        Avep = []
        for k, p in enumerate(predictions[key]):
            if p in answer:
                Avep.append((len(Avep)+1)/(k+1))

        scores.append(sum(Avep)/len(answer))

    result={}
    result['MAP']= round(np.mean(scores),4)
    return result

Can you please check whether it's correct?

from codexglue.

guody5 commented on May 12, 2024

In fact, we follow this paper in C.4 Evaluation Metrics. According the description of paper, the MAP@R is defined as the mean of average precision scores, each of which is evaluated for retrieving R most similar samples given a query. In our case, the set of queries is the set of all test programs. For a program, R is the number of other programs in the same class, i.e. R=499. Therefore, the calculation is same as the paper and all results are comparable. For distinguishing them, we follow the paper and call MAP@R instead of MAP.

from codexglue.

deplay commented on May 12, 2024

MISIM https://arxiv.org/pdf/2006.05265.pdf refers another paper Musgrave et.al A Metric Learning Reality Check, 2020. https://arxiv.org/pdf/2003.08505.pdf

In Musgrave's paper,they provide following contents:

In CodeXGLUE/Code-Code/Clone-detection-POJ-104/evaluator/evaluator.py,the essential implementation "scores.append(len(set(a&p))/len(a))" seems calculate R-Precision rather than MAP@R

from codexglue.

guody5 commented on May 12, 2024

You are right. Thanks. We have corrected the evaluator.py and update the results.

from codexglue.

deplay commented on May 12, 2024

the implementation seems ok, and you may want to update here.

from codexglue.

Recommend Projects

Problem about calculating the MAP score in CodeXGLUE/Code-Code/Clone-detection-POJ-104/ about codexglue HOT 5 CLOSED

Comments (5)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs