Skip to content
View john-b-yang's full-sized avatar
🐶
wuphf.com
🐶
wuphf.com

Highlights

  • Pro

Organizations

@saasbook @SoftwareDefinedBuildings @61c-teach @SWE-bench

Block or report john-b-yang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
john-b-yang/README.md

Hey there 👋

I'm John! Currently a 1st year CS PhD student at Stanford University.

Check out john-b-yang.github.io for more.

Pinned Loading

  1. SWE-bench/SWE-smith SWE-bench/SWE-smith Public

    Scaling Data for SWE-agents

    Python 216 14

  2. SWE-agent/SWE-agent SWE-agent/SWE-agent Public

    SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

    Python 16k 1.7k

  3. SWE-bench/SWE-bench SWE-bench/SWE-bench Public

    SWE-bench [Multimodal]: Can Language Models Resolve Real-world Github Issues?

    Python 3k 517

  4. SWE-bench/experiments SWE-bench/experiments Public

    Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.

    Shell 176 189

  5. princeton-nlp/WebShop princeton-nlp/WebShop Public

    [NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents

    Python 351 73

  6. princeton-nlp/intercode princeton-nlp/intercode Public

    [NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898

    Python 219 47