Skip to content

Evaluation code#8

Merged
Jackal1586 merged 14 commits intomainfrom
evaluation_code
Sep 17, 2024
Merged

Evaluation code#8
Jackal1586 merged 14 commits intomainfrom
evaluation_code

Conversation

@sbmaruf
Copy link
Copy Markdown
Contributor

@sbmaruf sbmaruf commented Nov 7, 2023

Sample scripts of the evaluation.

Since running gen_program_synthesis.py can be costly, I have uploaded the openai projected samples in this temporary dropbox. If you are reviewing this code or just browsing, please do not upload this
data anywhere for the integrity (cross site data contamination for LLM) of the benchmark. You may use this projected data just for testing in your local machine.

@Jackal1586 Jackal1586 merged commit 4993b99 into main Sep 17, 2024
@Jackal1586 Jackal1586 mentioned this pull request Sep 17, 2024
@yazdanbakhsh
Copy link
Copy Markdown

Sounds good.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants