denispeskoff / fednlp Goto Github PK

View Code? Open in Web Editor NEW

1.0 1.0 4.0 20.99 MB

Jupyter Notebook 99.80% Python 0.20%

fednlp's Issues

Minutes: Entire Minutes

For every available meeting minutes, ingest the entire minutes and predict -1, -.5, 0, .5, 1

Task 5: Plot Sentence-Level Results

@Benjamin Wachspress could you please plot the sentence-level results for us?

In the results folder, it is this file.

line plot
date on the x-axis
hawk/dove score on the y-axis
make sure y-axis label indicates which is hawkish and which is dovish.

Thanks!

Run all minutes data

In addition to the two transcript requests,if <$200

run all the MINUTES data the same way as the statements
https://github.com/trigaten/FedNLP/blob/main/data/minutes.csv

INPUT ENTIRE minutes -> OUTPUT single -1 to 1 prediction.

Task 3: Record Descriptive Stats from Sentence Level Processing

@sanderschulhoff in addition to the unweighted average, it would be helpful to also have access to the raw counts for each sentence level classification. For the task you have already completed as well as for the upcoming sentence-level work on transcripts.

For example output: {19940204: {-1: 10, -0.5: 2, 0:25, 0.5:4, 1:10, avg:0.4},
19940322: {-1: 4, -0.5: 0, 0:18, 0.5:3, 1:6, avg:0.3333333333333333}, etc}

Prioritization

1.X
2.Y
3.Z

Task 7: finalizing figure

We need a beautiful figure to put either in the top right of the second column on the first page that summarizes our task.

Or one on the top of the second page (spanning the entire width).

(Or likely both, with one explaining the task, and one presenting the results)

Task 9: Prepare transcript data by speaker for GPT processing

We want to create data that lumps everything said by a speaker within a transcript together:

John: sentence 1, sentence 2, sentence 6, sentence 7
Sally: sentence 3, sentence 4
Joe: sentence 5

We will want to be able to 1) evaluate the hawk/dove stance for a speaker 2) the aggregate stance of all speakers in the transcript 3) how the same speaker changes throughout different years of transcripts

Transcripts: By Speaker

For each available transcript, calculate BY SPEAKER a prediction of -1, -.5, 0, .5, 1. Keep this separated across unrelated transcripts.

Example:
Joe: sentence 1. sentence 2. sentence 3
Adam: sentence 1
Joe sentence 1. sentence 2

Should be predicted as:
Joe: .5
Adam: 0
Joe: 1

Task 8: Manual annotation of ALL statements

We need to go through all the statements we are sending to GPT and rank them -1, -.5, 0, .5, 1 the same way Alan did in the email to create a gold label.

Following this guidance

Please optimizing accuracy first then speed second, classify all statements.
If you are unsure then mark is as ? and we can escalate to Alan for grey area decisions.

Also keep an eye peeled for anything especially notable/worth commenting on. (the 1 out of 100 statement)

Alan's thoughts were:
Nov. 1994 is a definite +1, not so much for the wording (though that fits your scale) as for the huge 75 bp move.
Dec. 2008 is way beyond a -1. The Fed threw the kitchen sink at the failing economy then—big rate cut, forward guidance,…

Task 2: hawk/dove for each statement, not by sentence

@sanderschulhoff
can you do for the ENTIRE statement? That should be easy if ingestible. Then we can compare that how that looks relative to a sentence average.

Task 6: Speaker Level Transcript

@sanderschulhoff Next step is to process the transcripts. The main difference from the statements is we need to produce a hawk/dove score for each speaker, for each meeting. So the first new task is, for each speaker, create a list of all of their sentences. We will then iterate and score those speaker sentences.

For each transcript from 1994-2016:
Get all sentences for each speaker:
For each sentence by speaker:
Measure each sentence on scale of -1, -0.5, 0, 0.5, 1 (according to Category definitions from above)
Record average measurement for each speaker
Return average measurement for each transcript

For example output: {19940204: {greenspan:0.46, yellen:-0.25, geitner:0.65, ... , avg:0.26},
{19940204: {greenspan:0.1, yellen:-0.38, geitner:0.45, ... , avg:-0.15}, etc}

Run 4 sentences (used for our figure) through GPT

Run each one as a separate sentence prediction:

“The gradual increase in core inflation over the past year is a concern to me.”
“I do not believe that there is a very great risk of an unmanageable outbreak of inflation during the relevant policy horizon.”
“I think we should not slow our pace of easing moves at this meeting.”

(this one is a statement but don't think that changes anything)
4.“The Committee continues to believe that against the background of its long-run goals of price stability and sustainable economic growth and of the information currently available, the risks are weighted mainly toward conditions that may generate economic weakness in the foreseeable future.”

Statements: Entire Statement

For all available statements, predict the entire statement (all sentences together) as -1, -.5, 0, .5, 1.

Task 1: recreate sentence level classification

For starters, here is the paper I referenced earlier today where researchers at the Fed tested GPT against other traditional NLP methods. Let's start by recreating this using the statements in our dataset from 1994-2016. Specifically, we should prompt GPT-4 to characterize each sentence exactly as they did in their paper, from -1 to 1 (photo attached). Once we have all of the sentences marked, we can take an unweighted average of the scores for all sentences within a meeting to give us a "score" for that date. Let me know if you have questions, but I think starting here makes sense as it is more straightforward than working with the minutes/transcripts.

Tarullo_-1_dove.txt
Mester_.5_Hawk.txt
George_1_hawk.txt

Run all transcript data

In addition to the by-speaker version of the transcript (#7), If the cost is <$200,

run all the transcript data the same way as the statements
https://github.com/trigaten/FedNLP/blob/main/data/transcripts.csv

INPUT ENTIRE transcript -> OUTPUT single -1 to 1 prediction.

denispeskoff / fednlp Goto Github PK

fednlp's Issues

Recommend Projects

Recommend Topics

Recommend Org

Jobs