GithubHelp home page GithubHelp logo

swiftyos / auto-gpt-benchmarks Goto Github PK

View Code? Open in Web Editor NEW

This project forked from significant-gravitas/auto-gpt-benchmarks

1.0 1.0 0.0 34.94 MB

A repo built for the purpose of benchmarking the performance of agents, regardless of how they are set up and how they work.

License: MIT License

Shell 0.07% Python 46.36% HTML 0.13% Jupyter Notebook 53.43%

auto-gpt-benchmarks's Introduction

About Me

I'm an AI Engineer with a background in Aerospace Engineering, specializing in the development of intelligent agents. My expertise in Python and Rust allows me to build AI systems that are not only powerful but also efficient and scalable.

Projects

AutoGPT

  • GitHub Repo: AutoGPT
  • Contributed significantly to the AutoGPT project. AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Published Papers

  • GAIA: a benchmark for General AI Assistants
    Authors: Grégoire Mialon, Clémentine Fourrier, Craig Swift, Thomas Wolf, Yann LeCun, Thomas Scialom
    arXiv Link: 2311.12983
    Introduced GAIA, a benchmark for evaluating general AI assistants, focusing on complex challenges like reasoning and multi-modality, presented at ICLR.

  • Testing Language Model Agents Safely in the Wild
    Authors: Silen Naihin, David Atkinson, Marc Green, Merwane Hamadi, Craig Swift, Douglas Schonholtz, Adam Tauman Kalai, David Bau
    arXiv Link: 2311.10538
    Proposed a framework for conducting safe tests of autonomous agents in real-world settings, highlighting the integration of safety and reliability, presented at NeurIPS.

Quick Stats

Swiftos' GitHub stats

auto-gpt-benchmarks's People

Contributors

ambujpawar avatar auto-gpt-bot avatar chitalian avatar dschonholtz avatar erik-megarad avatar fluder-paradyne avatar jakubno avatar lc0rp avatar mrbrain295 avatar nerfzael avatar pwuts avatar rihp avatar scarletpan avatar silennaihin avatar swiftyos avatar torantulino avatar waynehamadi avatar westonwillingham avatar

Stargazers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.