跳到主要内容

图像识别

使用多模态模型识别图片内容,并返回文字描述/回答。

本接口与 Chat Completions 兼容,使用同一个 endpoint:POST /v1/chat/completions

Try It

POST/v1/chat/completionshttps://api-platform.ope.ai

认证

使用 Bearer Token 认证。

  • Header:Authorization: Bearer <token>
  • 示例:Authorization: Bearer sk-xxxxxx

请求体(application/json)

字段类型必填说明默认值 / 范围
modelstring模型 ID(多模态/视觉模型)例如:Image-Recognition
messagesarray<object>对话消息列表(支持图文混合 content)-
max_tokensinteger最大输出 token-
temperaturenumber采样温度默认 1;范围 0~2
top_pnumber核采样参数默认 1;范围 0~1
top_kintegerTop-K 采样参数-
frequency_penaltynumber频率惩罚默认 0;范围 -2~2

messages.content(图文混合)

messages[].content 支持数组形式,常见两种 item:

  • 图片:
    • {"type":"image_url","image_url":{"url":"data:image/jpeg;base64,<BASE64_IMAGE>"}}
  • 文本:
    • {"type":"text","text":"这是什么?"}

image_url.url 既可以是网络图片 URL,也可以是 Data URL(如 data:image/jpeg;base64,...)。

请求示例

下方域名为示例:https://api-platform.ope.ai

curl -X POST "https://api-platform.ope.ai/v1/chat/completions" \
-H "Authorization: Bearer $OPEAI_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "Image-Recognition",
"max_tokens": 1024,
"messages": [
{
"role": "user",
"content": [
{
"type": "image_url",
"image_url": {
"url": "data:image/jpeg;base64,/9j/4AAQSkZJRgABAQAAAQABA...(省略)"
}
},
{ "type": "text", "text": "这是什么?" }
]
}
]
}'

响应示例

{
"id": "string",
"object": "chat.completion",
"created": 0,
"model": "string",
"choices": [
{
"index": 0,
"message": {
"role": "system",
"content": "string",
"name": "string",
"tool_calls": [
{
"id": "string",
"type": "function",
"function": { "name": "string", "arguments": "string" }
}
],
"tool_call_id": "string",
"reasoning_content": "string"
},
"finish_reason": "stop"
}
],
"usage": {
"prompt_tokens": 0,
"completion_tokens": 0,
"total_tokens": 0,
"prompt_tokens_details": {
"cached_tokens": 0,
"text_tokens": 0,
"audio_tokens": 0,
"image_tokens": 0
},
"completion_tokens_details": {
"text_tokens": 0,
"audio_tokens": 0,
"reasoning_tokens": 0
}
},
"system_fingerprint": "string"
}