The neuroscout-paper's discuss from neuroscout

neuroscout-paper's Issues

dataset descriptors for methods section

get number of unique subjects and average scan time per subject for all datasets

Final read through (es discussion)

Submit meta-analytic maps to neurovault

logo broken on jupyter book

Logo on https://neuroscout.github.io/neuroscout-paper/intro.html is broken

tidy up face and language notebooks

Issues with lexical frequency spoken

Identical DMs show up for different models
hierarchical structure is messed up
missing effect and variances

Running NNDb single (not combined)

Final Final AudioSet + Clarifai models

@tyarkoni will revise list of single predictor models to add "body" and remove a few.

start with single dataset results on Building, then go into meta-analysis?

Incorporate analysis_ids into paper, as table and reference in Results?

Dataset or feature specific issues / irregularities

This is just a record of some "quirks" that are dataset or feature specific. Feel free to edit and add to the list
Will work on making a more general version of this for public consumption

StudyForrest is in German and has no ingested speech transcript
Sherlock movie is present in the Sherock and and SherlockMerlin datasets, with different subjects. We're going to focus on the Sherlock dataset for the Sherlock task, and the Merlin task from the SherlockMerlin dataset.
SchematicNarrative has no "tokenized" BERT features because there are independent stimuli within each run.
See: neuroscout/neuroscout#772
Life dataset has no faces due to the nature of the stimuli

shorten methods

remove feature descriptions for features we do not use (e.g., BERT or AudioSet)

fix missing shot change for NNDB

Display surface plots inline notebooks if possible (if not, its ok)

Load images from Neurovault instead of drakh

For maximal reproducibility

re-run lexical frequency notebook plots (not models)

To fix discrepancy w/ task names

Update requirements.txt and add Dockerfile

write up discussion/conclusions

Points maybe worth mentioning

Neuroscout makes multi-dataset reproducible workflows accessible
Makes using novel features easy
Not mutually exclusive w/ experimental research
Caveats in the interpretation of results (e.g., features are model dependent)
What next
- More datasets and features
- Dataset release
- Enable browsing
- Better integration with meta-analysis workflow
- Support other models?

refine plotting utils

create utils for surface plots (at least for single-predictor models), we need more compact ways to visualize results

re-run some audioset models

re-run some audioset meta-analyses w/ new datasets (e.g., speech, music, whistling?), possibly with thresholding

Include report plots and regressor plots

Once the set of models is final, we should re-run all notebooks so to include at least some sample reports and timeline/distribution plots for regressors of interest.

figure for platform overview / user flow

simpler figure for platform overview (very "functional", displaying Neuroscout components by their role in the workflow)

Remove empty nodes from json collections

Datasets with no model (e.g. studyforrest for entropy models) are still included in the collections as empty nodes (studyforrest: {}) We should drop from all collections all nodes for datasets with no associated analysis.

Add meta-analytic maps + upload to NV

we haven't pushed meta-analytic maps to this repo (though they can be reproduced).
should we do that?

fix schematicnarrative missing runs

running: big sensorimotor norm model

inspect LM surprisal

Get surprisal metrics for different models and across transcript vs. force-aligned (= no punctuation)
Look at correlations between models
Qualitative inspection of examples
For now, only focus on window_size = 25

Final checklist

Export to Overleaf
Add Acknowledgments
Table 1: Add links
Fig 3 label is on dotted line
Add line numbers (perhaps make 2 version at last minute)
Consistent use of em dash (—). It should be specifically that charachter and have no spaces around it.
The FaceNet methods section has several variables not presented (e.g. first_time_face)
Rebuild jupyter book and link

Citation related issues:

fMRIprep section needs references properly cited
Manually check all refs
Markiewickz citations look odd to. e.g.: C. Markiewicz et al., 2021; C. J. Markiewicz et al., 2021 - statsmodels and fitlins respsectively

run single-predictor mel models

fit separate models with re-extracted mel features to reconstruct tonotopic maps (not necessarily relevant for the paper, but as preliminary result for ohbm submission and to kick-start some audio analyses).
Analysis should probably be set up as classification.

(re)run analyses for final version of paper

Single predictor models:

Should be all done

Frequency analyses:

We need to look into which models to report, but probably something incremental
Run on NNDB and Narratives
Check consistency in estimators and if inconsistent rerun

Shot-change:

Re-run on NNDB
Check consistency in estimators and if inconsistent rerun

Lancaster norms:

Rerun on NNDB and Narratives
Check consistency in estimators and if inconsistent rerun

FaceNet:

@adelavega extracts NNDB
Run NNDB
Check consistency in estimators and if inconsistent rerun

AudioSet:

Run music on NNDB and Narratives (need to be extracted)
Maybe explore couple of other features (selectively pick them from the ontology)
Check results and make decision on whether to keep them
If we keep them, check consistency in estimators and if inconsistent rerun

BERT:

Drop if for now (play with it in separate projects, e.g., https://github.com/rbroc/eval-neuroscout-lm) '
Drop the embeddings too, but that could also go in a separate project

Reading brain dataset:

Ignore for now, but maybe run frequency on it after everything else is done.

shorten FFA and frequency paragraphs
add Lancaster norms discussion

Detect and correct these NV collections.

neuroscout / neuroscout-paper Goto Github PK

neuroscout-paper's Issues

Recommend Projects

Recommend Topics

Recommend Org

Jobs