跳到主要内容

ai-prompt-guard

ai-prompt-guard 插件通过检查和验证传入的提示词消息来保护你的 LLM 端点。它根据用户定义的允许和拒绝模式检查请求内容,确保只有批准的输入被转发到上游 LLM。根据配置,该插件可以仅检查最新消息或整个对话历史记录,并且可以设置为检查所有角色的提示词或仅检查最终用户的提示词。

当同时配置了 allow_patternsdeny_patterns 时,插件首先确保至少匹配一个 allow_patterns。如果没有匹配,请求将被拒绝。如果匹配了允许模式,它会接着检查是否出现了拒绝模式。

示例

以下示例将使用 OpenAI 作为上游服务提供商。在开始之前,请创建一个 OpenAI 账号 并获取 API Key。你可以选择将密钥保存到环境变量中,如下所示:

export OPENAI_API_KEY=<YOUR_OPENAI_API_KEY>   # 替换为你的 API 密钥

如果你使用其他 LLM 提供商,请参考该提供商的文档获取 API Key。

实现允许和拒绝模式

以下示例演示了如何使用 ai-prompt-guard 插件通过定义允许和拒绝模式来验证用户提示词,并理解允许模式的优先级。

定义允许和拒绝模式。你可以选择将它们保存到环境变量中以便更容易转义:

# 允许美元金额
export ALLOW_PATTERN_1='\\$?\\(?\\d{1,3}(,\\d{3})*(\\.\\d{1,2})?\\)?'
# 拒绝美国格式的电话号码
export DENY_PATTERN_1='(\\([0-9]{3}\\)|[0-9]{3}-)[0-9]{3}-[0-9]{4}'

创建一个使用 ai-proxy 代理到 OpenAI 并使用 ai-prompt-guard 检查输入提示词的路由:

curl "http://127.0.0.1:9180/apisix/admin/routes" -X PUT \
-H "X-API-KEY: ${ADMIN_API_KEY}" \
-d '{
"id": "ai-prompt-guard-route",
"uri": "/anything",
"methods": ["POST"],
"plugins": {
"ai-proxy": {
// Annotate 1
"provider": "openai",
"auth": {
"header": {
// Annotate 2
"Authorization": "Bearer '"$OPENAI_API_KEY"'"
}
},
"options":{
// Annotate 3
"model": "gpt-4"
}
},
"ai-prompt-guard": {
"allow_patterns": [
// Annotate 5
"'"$ALLOW_PATTERN_1"'"
],
"deny_patterns": [
// Annotate 6
"'"$DENY_PATTERN_1"'"
]
}
}
}'

❶ 指定提供商为 openai

❷ 在 api-key 头中附带 OpenAI API Key。

❸ 指定模型名称。

❹ 允许匹配任何美元金额的消息模式。

❺ 拒绝包含任何美国电话号码格式的消息模式。

向该路由发送请求以评价一次购买的公允性:

curl -i "http://127.0.0.1:9080/anything" -X POST \
-H "Content-Type: application/json" \
-d '{
"messages": [
{ "role": "system", "content": "Rate if the purchase is at a decent price in USD." },
{ "role": "user", "content": "John paid $12.5 for a hot brewed coffee in El Paso." }
]
}'

你应该看到收到类似以下的 HTTP/1.1 200 OK 响应:

{
...
"model": "gpt-4-0613",
"choices": [
{
"index": 0,
"message": {
"role": "assistant",
"content": "The purchase is not at a decent price. Typically, a hot brewed coffee costs anywhere from $1 to $3 in most places in the US, so $12.5 is quite expensive.",
"refusal": null
},
"logprobs": null,
"finish_reason": "stop"
}
],
...
}

向该路由发送另一个请求,消息中不包含任何价格:

curl -i "http://127.0.0.1:9080/anything" -X POST \
-H "Content-Type: application/json" \
-d '{
"messages": [
{ "role": "system", "content": "Rate if the purchase is at a decent price in USD." },
{ "role": "user", "content": "John paid a bit for a hot brewed coffee in El Paso." }
]
}'

你应该收到 HTTP/1.1 400 Bad Request 响应,并看到以下消息:

{"message":"Request doesn't match allow patterns"}

向该路由发送第三个请求,消息中包含电话号码:

curl -i "http://127.0.0.1:9080/anything" -X POST \
-H "Content-Type: application/json" \
-d '{
"messages": [
{ "role": "system", "content": "Rate if the purchase is at a decent price in USD." },
{ "role": "user", "content": "John (647-200-9393) paid $12.5 for a hot brewed coffee in El Paso." }
]
}'

你应该收到 HTTP/1.1 400 Bad Request 响应,并看到以下消息:

{"message":"Request contains prohibited content"}

默认情况下,插件仅检查 user 角色的输入和最后一条消息。例如,如果你发送包含禁止内容的 system 提示词请求:

curl -i "http://127.0.0.1:9080/anything" -X POST \
-H "Content-Type: application/json" \
-d '{
"messages": [
{ "role": "system", "content": "Rate if the purchase from 647-200-9393 is at a decent price in USD." },
{ "role": "user", "content": "John paid $12.5 for a hot brewed coffee in El Paso." }
]
}'

你将收到 HTTP/1.1 200 OK 响应。

如果你发送的请求中,倒数第二条消息包含禁止内容:

curl -i "http://127.0.0.1:9080/anything" -X POST \
-H "Content-Type: application/json" \
-d '{
"messages": [
{ "role": "system", "content": "Rate if the purchase is at a decent price in USD." },
{ "role": "user", "content": "Customer John contact: 647-200-9393" },
{ "role": "user", "content": "John paid $12.5 for a hot brewed coffee in El Paso." }
]
}'

你也将收到 HTTP/1.1 200 OK 响应。

查看 下一个示例,了解如何检查所有角色的消息和所有消息。

验证所有角色的消息和对话历史

以下示例演示了如何使用 ai-prompt-guard 插件验证所有角色的提示词(例如 systemuser),并验证整个对话历史记录而不仅仅是最后一条消息。

定义允许和拒绝模式。你可以选择将它们保存到环境变量中以便更容易转义:

export ALLOW_PATTERN_1='\\$?\\(?\\d{1,3}(,\\d{3})*(\\.\\d{1,2})?\\)?'
export DENY_PATTERN_1='(\\([0-9]{3}\\)|[0-9]{3}-)[0-9]{3}-[0-9]{4}'

创建一个使用 ai-proxy 代理到 OpenAI 并使用 ai-prompt-guard 检查输入提示词的路由:

curl "http://127.0.0.1:9180/apisix/admin/routes" -X PUT \
-H "X-API-KEY: ${ADMIN_API_KEY}" \
-d '{
"id": "ai-prompt-guard-route",
"uri": "/anything",
"methods": ["POST"],
"plugins": {
"ai-proxy": {
"provider": "openai",
"auth": {
"header": {
"Authorization": "Bearer '"$OPENAI_API_KEY"'"
}
},
"options":{
"model": "gpt-4"
}
},
"ai-prompt-guard": {
// Annotate 1
"match_all_roles": true,
// Annotate 2
"match_all_conversation_history": true,
"allow_patterns": [
"'"$ALLOW_PATTERN_1"'"
],
"deny_patterns": [
"'"$DENY_PATTERN_1"'"
]
}
}
}'

❶ 验证所有角色的消息。

❷ 验证整个对话的消息。

发送一个请求,其中 system 提示词包含禁止内容:

curl -i "http://127.0.0.1:9080/anything" -X POST \
-H "Content-Type: application/json" \
-d '{
"messages": [
{ "role": "system", "content": "Rate if the purchase from 647-200-9393 is at a decent price in USD." },
{ "role": "user", "content": "John paid $12.5 for a hot brewed coffee in El Paso." }
]
}'

你应该收到 HTTP/1.1 400 Bad Request 响应,并看到以下消息:

{"message":"Request contains prohibited content"}

发送一个请求,其中同一角色的多条消息包含禁止内容:

curl -i "http://127.0.0.1:9080/anything" -X POST \
-H "Content-Type: application/json" \
-d '{
"messages": [
{ "role": "system", "content": "Rate if the purchase is at a decent price in USD." },
{ "role": "user", "content": "Customer John contact: 647-200-9393" },
{ "role": "user", "content": "John paid $12.5 for a hot brewed coffee in El Paso." }
]
}'

你应该收到 HTTP/1.1 400 Bad Request 响应,并看到以下消息:

{"message":"Request contains prohibited content"}

发送一个符合模式的请求:

curl -i "http://127.0.0.1:9080/anything" -X POST \
-H "Content-Type: application/json" \
-d '{
"messages": [
{ "role": "system", "content": "Rate if the purchase is at a decent price in USD." },
{ "role": "system", "content": "The puchase is made in El Paso." },
{ "role": "user", "content": "Customer John contact: xxx-xxx-xxxx" },
{ "role": "user", "content": "John paid $12.5 for a hot brewed coffee." }
]
}'

你应该看到收到类似以下的 HTTP/1.1 200 OK 响应:

{
...,
"model": "gpt-4-0613",
"choices": [
{
"index": 0,
"message": {
"role": "assistant",
"content": "$12.5 is generally considered quite expensive for a cup of brew coffee.",
"refusal": null
},
"logprobs": null,
"finish_reason": "stop"
}
],
...
}