AI Safety Diary: September 7, 2025
A diary entry on advanced prompt engineering techniques and the faithfulness of LLM self-explanations for commonsense tasks.
A diary entry on advanced prompt engineering techniques and the faithfulness of LLM self-explanations for commonsense tasks.