跳转至

文字检测

能力说明

本能力用于检测照片中出现的文字

方法标识

OCR

请求示例

请求参数示例:

{
    "uri": "http://pp.imgo.tv/upload/test/ocr.png"
}

返回结果示例:

{
    "action": "OCR",
    "code": 200,
    "data": {
        "debug": "http://s3.mediacloud.imgo.tv/open-ai/2023/2/6/7a76ef65efa6407c8c7159c961032c6b/7a76ef65efa6407c8c7159c961032c6b.jpg?bucket=open-ai&region=cn-changsha-1",
        "result": [
            {
                "coord": [
                    {
                        "x": 24,
                        "y": 17
                    },
                    {
                        "x": 193,
                        "y": 17
                    },
                    {
                        "x": 193,
                        "y": 38
                    },
                    {
                        "x": 24,
                        "y": 38
                    }
                ],
                "score": 0.9927455186843872,
                "label": "智能媒体云平台"
            },
            {
                "coord": [
                    {
                        "x": 50,
                        "y": 63
                    },
                    {
                        "x": 218,
                        "y": 63
                    },
                    {
                        "x": 218,
                        "y": 84
                    },
                    {
                        "x": 50,
                        "y": 84
                    }
                ],
                "score": 0.9518229365348816,
                "label": "·AI开放能力梳理"
            },
            {
                "coord": [
                    {
                        "x": 47,
                        "y": 109
                    },
                    {
                        "x": 265,
                        "y": 109
                    },
                    {
                        "x": 265,
                        "y": 129
                    },
                    {
                        "x": 47,
                        "y": 129
                    }
                ],
                "score": 0.9346679449081421,
                "label": "》原子能力(媒体处理"
            },
            {
                "coord": [
                    {
                        "x": 46,
                        "y": 151
                    },
                    {
                        "x": 158,
                        "y": 151
                    },
                    {
                        "x": 158,
                        "y": 175
                    },
                    {
                        "x": 46,
                        "y": 175
                    }
                ],
                "score": 0.8736327886581421,
                "label": "〉镜像管理"
            }
        ]
    },
    "msg": "success",
    "time": 1675653798158,
    "requestId": "7a76ef65efa6407c8c7159c961032c6b"
}

字段说明:

debug: 为调试图片文件,框选识别的文字信息并可视化展示(如下图所示);

result: 为识别的文字信息,coord字段为识别区域4个点的坐标,score为准确度,label为识别的文字结果。