Skip to content

Recreate results found in table 1 #4

Description

@theblackcat102

Hi, I wanted to check if running launchw.sh is the command which recreates the number for table 1?
Cause I'm trying to rerun REMEMBERER for gpt-3.5-instruct-0913 due davinci-003 was no longer accessible from openai platform.

But the results I got is quite low with only 0.07 success rate

[2024-05-02 12:45:31,856 INFO webshop/186-MainProcess] END! TaskIdx: 99, TaskId: 99, #Steps: 4(0), Reward: 0.50, Succeds: False
[2024-05-02 12:45:31,856 INFO webshop/189-MainProcess] ──────────8.44──────────0.254──────────0.070──────────
[2024-05-02 12:45:31,857 INFO webshop/497-MainProcess] ━━━━━━━━━━━━━━━━━━━Epoch 0━━━━━━━━━━━━━━━━━━━━
[2024-05-02 12:45:31,857 INFO webshop/498-MainProcess] Size: 4, Avg AD Size: 1

I was wonder if there's any params I didn't get right for the launchw.sh?

This was the command found in launchw.sh:

python webshop.py --log-dir logs\
				  --observation-mode text_rich\
				  --load-replay history-pools/init_pool.wq.yaml\
				  --load-replay history-pools/init_pool.wq.yaml\
				  --save-replay history-pools/init_pool.wqu."$date_str".%d.a.yaml\
				  --save-replay history-pools/init_pool.wqu."$date_str".%d.b.yaml\
				  --item-capacity 500\
				  --action-capacity 20\
				  --matcher pgpat+insrel\
				  --prompt-template prompts/\
				  --max-tokens 200 \
				  --stop "Discouraged" \
				  --request-timeout 10.\
				  --starts-from 0\
				  --epochs 3\
				  --trainseta 0\
				  --trainsetb 10\
				  --testseta 0\
				  --testsetb 100

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions