You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs-turing/administration-guide.md
+4-176Lines changed: 4 additions & 176 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -6,105 +6,9 @@ description: Viglet Turing ES Administration Guide
6
6
7
7
# Viglet Turing ES: Administration Guide
8
8
9
-
Viglet Turing ES ([https://viglet.com/turing](https://viglet.com/turing)) is an open source solution ([https://github.com/openviglet](https://github.com/openviglet)), which has Semantic Navigation and Chatbot as its main features. You can choose from several NLPs to enrich the data. All content is indexed in Solr as search engine.
9
+
Viglet Turing ES ([https://viglet.com/turing](https://viglet.com/turing)) is an open source enterprise search platform ([https://github.com/openviglet](https://github.com/openviglet)) with Semantic Navigation and Generative AI as its main features. All content is indexed in Apache Solr as the primary search engine.
10
10
11
-
## Architecture
12
-
13
-

14
-
15
-
## NLP
16
-
17
-
Turing support the following providers:
18
-
19
-
### OpenNLP
20
-
21
-
Apache OpenNLP is a machine learning based toolkit for the processing of natural language text.
| Endpoint URL | Service URL, for example: `http://localhost:40000`|
46
-
| Key | This attribute is not used |
47
-
| Enabled | If the OTCA is enabled |
48
-
49
-
### CoreNLP
50
-
51
-
CoreNLP is your one stop shop for natural language processing in Java! CoreNLP enables users to derive linguistic annotations for text, including token and sentence boundaries, parts of speech, named entities, numeric and time values, dependency and constituency parses, coreference, sentiment, quote attributions, and relations. CoreNLP currently supports 6 languages: Arabic, Chinese, English, French, German, and Spanish.
| Endpoint URL | Service URL, for example: `https://language.googleapis.com/v1/documents:analyzeEntities`|
106
-
| Key | Key of Google Cloud Platform NLP |
107
-
| Enabled | If the GCP NLP is enabled |
11
+
For an overview of the system architecture and component interactions, see [Architecture Overview](./architecture-overview.md).
108
12
109
13
## Documents and OCR
110
14
@@ -114,46 +18,7 @@ It can read PDFs and Documents and convert to plain text and also it uses OCR to
114
18
115
19
### Connectors
116
20
117
-
Semantic Navigation uses Connectors to index the content from many sources.
118
-
119
-
#### Apache Nutch
120
-
121
-
Plugin for Apache Nutch to index content using crawler.
122
-
123
-
Learn more at [Connectors](./connectors#apache-nutch)
124
-
125
-
#### Database
126
-
127
-
Command line that uses the same concept as sqoop ([https://sqoop.apache.org/](https://sqoop.apache.org/)), to create complex queries and map attributes to index based on the result.
128
-
129
-
Learn more at [Connectors](./connectors#database)
130
-
131
-
#### File System
132
-
133
-
Command line to index files, extracting text from files such as Word, Excel, PDF, including images, through OCR.
134
-
135
-
Learn more at [Connectors](./connectors#file-system)
136
-
137
-
#### OpenText WEM Listener
138
-
139
-
OpenText WEM Listener to publish content to Viglet Turing.
140
-
141
-
#### Wordpress
142
-
143
-
Wordpress plugin that allows you to index posts.
144
-
145
-
Learn more at [Connectors](./connectors#wordpress)
146
-
147
-
### Named Entity Recognition (NER)
148
-
149
-
With NLP it is possible to detect entities such as:
150
-
151
-
- People
152
-
- Places
153
-
- Organizations
154
-
- Money
155
-
- Time
156
-
- Percentage
21
+
Content is delivered to Turing ES by **Viglet Dumont DEP**, a separate application that manages connectors independently. Dumont DEP sends documents to Turing ES via the REST API. Available connectors include WebCrawler (Apache Nutch), Database, FileSystem, AEM/WEM, and WordPress. Refer to the [Dumont DEP documentation](/dumont) for connector configuration.
157
22
158
23
### Facets
159
24
@@ -167,41 +32,9 @@ Through attributes defined in the contents, it is possible to use them to restri
167
32
168
33
Java API ([https://github.com/openviglet/turing-java-sdk](https://github.com/openviglet/turing-java-sdk)) facilitates the use and access to Viglet Turing ES, without the need for consumer search content with complex queries.
169
34
170
-
## Chatbot
171
-
172
-
Communicate with your client and elaborate complex intents, obtain reports and progressively evolve your interaction.
173
-
174
-
### Agent
175
-
176
-
Handles conversations with your end users. It is a natural language processing module that understands the nuances of human language.
177
-
178
-
### Intent
179
-
180
-
An intent categorizes an end user's intention for taking a conversation shift. For each agent, you define several intents, where your combined intents can handle a complete conversation.
181
-
182
-
### Actions
183
-
184
-
The field of action is a simple field of convenience that helps to execute logic in the service.
185
-
186
-
### Entity
187
-
188
-
Each intent parameter has a type, called an entity type, that dictates exactly how the data in an end user expression is extracted.
189
-
190
-
### Training
191
-
192
-
Defines and corrects intents.
193
-
194
-
### History
195
-
196
-
Shows the conversation history and reports.
197
-
198
-
## OpenText Blazon Integration
199
-
200
-
Turing ES detects Entities of OpenText Blazon Documents using OCR and NLP, generating Blazon XML to show the entities into document.
201
-
202
35
## Turing ES Console
203
36
204
-
Turing ES has many components: Search Engine, NLP, Converse (Chatbot), Semantic Navigation.
37
+
Turing ES has many components: Search Engine, Semantic Navigation, and Generative AI.
205
38
206
39
### Login
207
40
@@ -243,15 +76,13 @@ The Settings of Semantic Navigation Site contains the following attributes:
243
76
| Name | Name of Semantic Navigation Site |
244
77
| Description | Description of Semantic Navigation Site |
245
78
| Search Engine | Select the Search Engine that was created in Search Engine Section |
246
-
| NLP Vendor | NLP Vendor for this site |
247
79
| Thesaurus | If will use Thesaurus |
248
80
249
81
##### Multi Languages Tab
250
82
251
83
| Attribute | Description |
252
84
|-----------|-------------|
253
85
| Language | Language for Semantic Navigation Site |
254
-
| NLP Instance | NLP Instance to detect entities during indexing |
255
86
| Core | Solr Core Name to store and to search indexed content |
256
87
257
88
##### Behavior Tab
@@ -291,13 +122,11 @@ Fields Tab contains a table with the following columns:
291
122
292
123
| Column Name | Description |
293
124
|-------------|-------------|
294
-
| Type | Type of Field: NER (Named Entity Recognition) or Search Engine (Solr) |
295
125
| Field | Name of Field |
296
126
| Enabled | If the field is enabled |
297
127
| MLT | If this field will be used in MLT |
298
128
| Facets | To use this field like a facet (filter) |
299
129
| Highlighting | If this field will show highlighted lines |
300
-
| NLP | If this field will be processed by NLP to detect Entities |
301
130
302
131
When clicking a Field, a details page appears with:
303
132
@@ -314,7 +143,6 @@ When clicking a Field, a details page appears with:
314
143
| Enabled | If the field is enabled |
315
144
| Required | If the field is required |
316
145
| Default Value | Default value if content is indexed without this field |
0 commit comments