文字检测
能力说明¶
本能力用于检测照片中出现的文字
方法标识¶
OCR
请求示例¶
请求参数示例:
{
"uri": "http://pp.imgo.tv/upload/test/ocr.png"
}
返回结果示例:
{
"action": "OCR",
"code": 200,
"data": {
"debug": "http://s3.mediacloud.imgo.tv/open-ai/2023/2/6/7a76ef65efa6407c8c7159c961032c6b/7a76ef65efa6407c8c7159c961032c6b.jpg?bucket=open-ai®ion=cn-changsha-1",
"result": [
{
"coord": [
{
"x": 24,
"y": 17
},
{
"x": 193,
"y": 17
},
{
"x": 193,
"y": 38
},
{
"x": 24,
"y": 38
}
],
"score": 0.9927455186843872,
"label": "智能媒体云平台"
},
{
"coord": [
{
"x": 50,
"y": 63
},
{
"x": 218,
"y": 63
},
{
"x": 218,
"y": 84
},
{
"x": 50,
"y": 84
}
],
"score": 0.9518229365348816,
"label": "·AI开放能力梳理"
},
{
"coord": [
{
"x": 47,
"y": 109
},
{
"x": 265,
"y": 109
},
{
"x": 265,
"y": 129
},
{
"x": 47,
"y": 129
}
],
"score": 0.9346679449081421,
"label": "》原子能力(媒体处理"
},
{
"coord": [
{
"x": 46,
"y": 151
},
{
"x": 158,
"y": 151
},
{
"x": 158,
"y": 175
},
{
"x": 46,
"y": 175
}
],
"score": 0.8736327886581421,
"label": "〉镜像管理"
}
]
},
"msg": "success",
"time": 1675653798158,
"requestId": "7a76ef65efa6407c8c7159c961032c6b"
}
字段说明:
debug: 为调试图片文件,框选识别的文字信息并可视化展示(如下图所示);
result: 为识别的文字信息,coord字段为识别区域4个点的坐标,score为准确度,label为识别的文字结果。