Skip to content

amr解析部分数字出错 #1721

@SoaringTiger

Description

@SoaringTiger

Describe the bug
例1:我给了他15万元。
amr 解析结果如下图:
bug
15万” 未被正确解析


例2: 我给了他十五点八万元。
bug2

十五点八万” 未被正确解析


例3: 我给了他十元三角八分钱。
屏幕快照 2022-04-15 下午5 54 34
十元三角八分” 未被正确解析

Code to reproduce the issue
Provide a reproducible test case that is the bare minimum necessary to generate the problem.

Describe the current behavior
将“15万”改为“十五万”后,可解析为 “150000”
错误应出自数字转换的过程。 可以参考 https://github.com/microsoft/Recognizers-Text

Expected behavior
能正确显示 label。
当然了,输出数据里的 anchors 标记了原文位置,所以问题也不是特别的大😄

看了下输出的数据,anchors是保留了原文的位置,所以问题也不是特别的大。

System information

  • OS Platform and Distribution (Linux Ubuntu 16.04):
  • Python version: 3.9
  • HanLP version: 2.1b23

Other info / logs
Include any logs or source code that would be helpful to diagnose the problem. If including tracebacks, please include the full traceback. Large logs and files should be attached.

  • I've completed this form and searched the web for solutions.

Metadata

Metadata

Assignees

Labels

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions