Comments (3)
The behaviour is explained in custom-eval.md:
If you notice evals has cached your data and you need to clear that cache, you can do so with
rm -rf /tmp/filecache
.
This will be a difficult spot for people looking through custom-eval I think. I've also noticed that the flag --no-cache
exists in the list of oaieval options. I'm not sure if this is intended to prevent .jsonl files from being cached (which would be a nice feature), but as far as I can tell the code associated with the arg isn't doing anything:
Lines 173 to 175 in 19bfdba
Though of course @andrew-openai will know a lot more than I do!
from evals.
I'm currently having the same problem. I'm unsure of how to clear the cache to get it to "see" my new jsonl file. I've tried to change the name of the test to no avail
EDIT: After running with --debug, I found it was loading a pkl
file from my /tmp
folder. After deleting it I got it to run all my samples.
from evals.
Thanks, it helps.
The behaviour is explained in custom-eval.md:
If you notice evals has cached your data and you need to clear that cache, you can do so with
rm -rf /tmp/filecache
.This will be a difficult spot for people looking through custom-eval I think. I've also noticed that the flag
--no-cache
exists in the list of oaieval options. I'm not sure if this is intended to prevent .jsonl files from being cached (which would be a nice feature), but as far as I can tell the code associated with the arg isn't doing anything:Lines 173 to 175 in 19bfdba
Though of course @andrew-openai will know a lot more than I do!
from evals.
Related Issues (20)
- Using different models in evaluating mode-graded eval and in generating the completion HOT 5
- `Failed to open: ../registry/data/social_iqa/few_shot.jsonl` with custom registry
- Evals broken with latest openai package v1.1.1 HOT 2
- Do not back off on `openai.BadRequestError` HOT 1
- Proposal for Adding a New Evaluation Metric: Sentiment Analysis Accuracy
- Improvements to `Match`: case insensitive and strip
- Running an evaluation can lead to circular import error HOT 4
- oaieval doesn't run beacuse of "module 'openai' has no attribute 'error'" HOT 3
- Error structure in `utils` after openai package upgrade HOT 2
- Mismatch between LangChainChatModelCompletionFn code and registry HOT 3
- Possibility to sell high quality benchmarks HOT 1
- Request to change arithmetical_puzzles prompting
- Tagged Release For 2.0.0 HOT 1
- Local run doesn't save logs to disk HOT 1
- Support for Azure OpenAI client HOT 2
- Support multiple completions for ModelbasedClassify
- `OpenAIChatCompletionFn` should `__init__` should accept `**kwargs`
- Setting completion function args via CLI does not work
- When installing the project dependencies, i got: "ERROR: Could not build wheels for greenlet, which is required to install pyproject.toml-based projects" HOT 3
- Getting started example doesn't work - oieval attempts to update a None type object HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from evals.