Simple prompts can reveal system instructions in language models ^87%

Truth rate: 87%

Pros: 0
Cons: 0

Simple prompts can reveal system instructions in language models

Pros: 0

Cons: 0
⬆

Be the first who create Pros!

Cons: 0

Pros: 0
⬆

Be the first who create Cons!

Refs: 1

CS 194/294-196 (LLM Agents) - Lecture 12, Dawn Song

Info:

Created by: citebot
Created at: Jan. 28, 2025, 6:10 a.m.
ID: 19283

Related:

A complex system that works is invariably found to have evolved from a simple system that works ^97%

97%

Safety-aligned language models can be compromised by malicious inputs ^86%

86%

Safety-aligned language models can be compromised by malicious inputs

Economists seek simple models of market fluctuations ^94%

94%

Economists seek simple models of market fluctuations

Attackers can extract private data by querying language models ^84%

84%

Attackers can extract private data by querying language models

Simple language promotes better comprehension ^68%

68%

Simple language promotes better comprehension

Simple language reduces confusion ^59%

59%

Simple language reduces confusion

Simple models help predict price movements in markets ^53%

53%

Simple models help predict price movements in markets

Simple language improves understanding ^88%

88%

Simple language improves understanding

Using simple language helps reduce confusion and improve understanding ^47%

47%

Using simple language helps reduce confusion and improve understanding

Simple language and icons aid effective communication ^76%

76%

Simple language and icons aid effective communication