sabiland
sabiland

Reputation: 2614

Current status of NLP API's (Api.ai, Wit.ai, Luis, etc.)?

The company where I work needs to choose one of the available NLP API's providers. For the POC I've integrated Api.Ai and I must say it works really good. Console UI is smooth and easy to work with. The same goes for REST API's.

Though I have to check and compare also other providers (Luis, Wit.ai, etc.)

Is there any recent "study" how the major AI providers compare nowadays (support, quality, etc.)? Everything info available I've found (about comparisons) was kind of outdated.

I've checked Microsoft's Luis -> looks like a copy (UI, etc.) of Api.Ai.

People are saying Wit.ai (Facebook) console is a mess to work with, etc.

There are +50 NLP API providers out there - it's hard to check every single one of them.

I would be very thankful if anyone has some recent comparison-infos about major NLP providers.

EDIT: We have decided for API.AI (DialogFlow since Google purchased it). I must say DialogFlow works amazingly. Latest V2 API works over gRPC.

Upvotes: 2

Views: 619

Answers (1)

savenkov
savenkov

Reputation: 668

We have recently published an evaluation study of seven NLU API-enabled services: API.ai, Amazon Lex, Microsoft LUIS, IBM Watson Conversation, wit.ai, Recast.ai or Snips.ai.

Briefly speaking:

  • IBM Watson intent detection is the best one, especially on smaller training datasets (although when trained on over 2000 samples the difference is indistinguishable).
  • API.AI is free, the performance on big enough training set matches IBM Watson and Microsoft LUIS.
  • Microsoft LUIS works significantly faster than others in our tests.
  • wit.ai has somewhat worse performance and response time than the three above, but it’s free and it provides the best language coverage (some 50 languages).
  • Amazon Lex has quite strict API limits (the training set size is limited to 200K symbols, which may be insufficient to reach a good intent detection quality for a multi-intent assistant; also it requires all training utterances to be labeled by entities, which complicated preparation of the dataset.

Upvotes: 2

Related Questions