[
https://issues.apache.org/jira/browse/OFBIZ-7954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16848198#comment-16848198 ]
Pierre Smits commented on OFBIZ-7954:
-------------------------------------
It seems to me that we're talking about a new feature. At least using new components (tika?) or enhancing the solr component. The ticket should reflect this better.
> Add NLP function to parse text content
> --------------------------------------
>
> Key: OFBIZ-7954
> URL:
https://issues.apache.org/jira/browse/OFBIZ-7954> Project: OFBiz
> Issue Type: Improvement
> Affects Versions: Trunk
> Reporter: Shi Jinghai
> Assignee: Shi Jinghai
> Priority: Minor
> Labels: NER, NLP
>
> NLP(Natural Language Processing) is an amazing tech can help us to improve our beloved OFBiz.
> NLP can be used to train your own model to parse product attributes and understand what it is, before importing the product, or used as an online CRM text robot to answer simple questions.
> We will try to add a new nlp component, and we hope we can get some help from the community.
> AFAIK, two apache projects are working on NLP:
>
http://tika.apache.org/>
http://opennlp.apache.org/> The Stanford NLP is used in tika:
>
http://nlp.stanford.edu/> While implementing this component, we found an ebay paper on this topic is very useful:
>
http://aclweb.org/anthology/D/D11/D11-1144.pdf> Kind Regards,
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)