Skip to content

Commit 8e2637f

Browse files
Alexandre Oliveiraclaude
andcommitted
docs(turing): clean up administration guide - remove NLP, NER, Chatbot, Blazon
- Remove old architecture diagram (superseded by architecture-overview.md) - Remove NLP section (OpenNLP, OTCA, CoreNLP, SpaCy, Polyglot, GCP NLP) — NLP removed from Turing ES - Remove Named Entity Recognition (NER) section - Remove Chatbot section (Agent, Intent, Actions, Entity, Training, History) - Remove OpenText Blazon Integration section - Update intro paragraph to reflect current feature set (Semantic Navigation + GenAI) - Add reference to architecture-overview.md - Remove NLP Vendor and NLP Instance from SN Site settings tables - Remove NLP/NER columns from Fields table - Update Connectors section to reference Viglet Dumont DEP Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
1 parent c035789 commit 8e2637f

1 file changed

Lines changed: 4 additions & 176 deletions

File tree

docs-turing/administration-guide.md

Lines changed: 4 additions & 176 deletions
Original file line numberDiff line numberDiff line change
@@ -6,105 +6,9 @@ description: Viglet Turing ES Administration Guide
66

77
# Viglet Turing ES: Administration Guide
88

9-
Viglet Turing ES ([https://viglet.com/turing](https://viglet.com/turing)) is an open source solution ([https://github.com/openviglet](https://github.com/openviglet)), which has Semantic Navigation and Chatbot as its main features. You can choose from several NLPs to enrich the data. All content is indexed in Solr as search engine.
9+
Viglet Turing ES ([https://viglet.com/turing](https://viglet.com/turing)) is an open source enterprise search platform ([https://github.com/openviglet](https://github.com/openviglet)) with Semantic Navigation and Generative AI as its main features. All content is indexed in Apache Solr as the primary search engine.
1010

11-
## Architecture
12-
13-
![Turing ES Architecture](/img/turing-diagram.png)
14-
15-
## NLP
16-
17-
Turing support the following providers:
18-
19-
### OpenNLP
20-
21-
Apache OpenNLP is a machine learning based toolkit for the processing of natural language text.
22-
23-
Website: [https://opennlp.apache.org/](https://opennlp.apache.org/)
24-
25-
| Attribute | Description |
26-
|-----------|-------------|
27-
| Title | OpenNLP Title |
28-
| Description | Description of OpenNLP |
29-
| Vendor | Select OpenNLP |
30-
| Endpoint URL | This attribute is not used |
31-
| Key | This attribute is not used |
32-
| Enabled | If the OpenNLP is enabled |
33-
34-
### OpenText Content Analytics
35-
36-
It transforms data into insights for better decision-making and information management while freeing up resources and time.
37-
38-
Website: [https://www.opentext.com/](https://www.opentext.com/)
39-
40-
| Attribute | Description |
41-
|-----------|-------------|
42-
| Title | OTCA Title |
43-
| Description | Description of OTCA |
44-
| Vendor | Select OTCA |
45-
| Endpoint URL | Service URL, for example: `http://localhost:40000` |
46-
| Key | This attribute is not used |
47-
| Enabled | If the OTCA is enabled |
48-
49-
### CoreNLP
50-
51-
CoreNLP is your one stop shop for natural language processing in Java! CoreNLP enables users to derive linguistic annotations for text, including token and sentence boundaries, parts of speech, named entities, numeric and time values, dependency and constituency parses, coreference, sentiment, quote attributions, and relations. CoreNLP currently supports 6 languages: Arabic, Chinese, English, French, German, and Spanish.
52-
53-
Website: [https://stanfordnlp.github.io/CoreNLP/](https://stanfordnlp.github.io/CoreNLP/)
54-
55-
| Attribute | Description |
56-
|-----------|-------------|
57-
| Title | CoreNLP Title |
58-
| Description | Description of CoreNLP |
59-
| Vendor | Select CoreNLP |
60-
| Endpoint URL | Service URL, for example: `http://localhost:9001` |
61-
| Key | This attribute is not used |
62-
| Enabled | If the CoreNLP is enabled |
63-
64-
### SpaCy
65-
66-
It is a free open-source library for Natural Language Processing in Python. It features NER, POS tagging, dependency parsing, word vectors and more.
67-
68-
Website: [https://spacy.io](https://spacy.io)
69-
70-
| Attribute | Description |
71-
|-----------|-------------|
72-
| Title | SpaCy Title |
73-
| Description | Description of SpaCy |
74-
| Vendor | Select SpaCy |
75-
| Endpoint URL | Service URL, for example: `http://localhost:2800` |
76-
| Key | This attribute is not used |
77-
| Enabled | If the SpaCy is enabled |
78-
79-
### Polyglot NLP
80-
81-
Polyglot is a natural language pipeline that supports massive multilingual applications.
82-
83-
Website: [https://polyglot.readthedocs.io](https://polyglot.readthedocs.io)
84-
85-
| Attribute | Description |
86-
|-----------|-------------|
87-
| Title | Polyglot Title |
88-
| Description | Description of Polyglot |
89-
| Vendor | Select Polyglot |
90-
| Endpoint URL | Service URL, for example: `http://localhost:2810` |
91-
| Key | This attribute is not used |
92-
| Enabled | If the Polyglot is enabled |
93-
94-
### Google Cloud Platform NLP
95-
96-
Analyze text with ES using pre-trained API or custom AutoML machine learning models to extract relevant entities, understand sentiment, and more.
97-
98-
Website: [https://cloud.google.com/natural-language](https://cloud.google.com/natural-language)
99-
100-
| Attribute | Description |
101-
|-----------|-------------|
102-
| Title | GCP NLP Title |
103-
| Description | Description of GCP NLP |
104-
| Vendor | Select Google Cloud Platform NLP |
105-
| Endpoint URL | Service URL, for example: `https://language.googleapis.com/v1/documents:analyzeEntities` |
106-
| Key | Key of Google Cloud Platform NLP |
107-
| Enabled | If the GCP NLP is enabled |
11+
For an overview of the system architecture and component interactions, see [Architecture Overview](./architecture-overview.md).
10812

10913
## Documents and OCR
11014

@@ -114,46 +18,7 @@ It can read PDFs and Documents and convert to plain text and also it uses OCR to
11418

11519
### Connectors
11620

117-
Semantic Navigation uses Connectors to index the content from many sources.
118-
119-
#### Apache Nutch
120-
121-
Plugin for Apache Nutch to index content using crawler.
122-
123-
Learn more at [Connectors](./connectors#apache-nutch)
124-
125-
#### Database
126-
127-
Command line that uses the same concept as sqoop ([https://sqoop.apache.org/](https://sqoop.apache.org/)), to create complex queries and map attributes to index based on the result.
128-
129-
Learn more at [Connectors](./connectors#database)
130-
131-
#### File System
132-
133-
Command line to index files, extracting text from files such as Word, Excel, PDF, including images, through OCR.
134-
135-
Learn more at [Connectors](./connectors#file-system)
136-
137-
#### OpenText WEM Listener
138-
139-
OpenText WEM Listener to publish content to Viglet Turing.
140-
141-
#### Wordpress
142-
143-
Wordpress plugin that allows you to index posts.
144-
145-
Learn more at [Connectors](./connectors#wordpress)
146-
147-
### Named Entity Recognition (NER)
148-
149-
With NLP it is possible to detect entities such as:
150-
151-
- People
152-
- Places
153-
- Organizations
154-
- Money
155-
- Time
156-
- Percentage
21+
Content is delivered to Turing ES by **Viglet Dumont DEP**, a separate application that manages connectors independently. Dumont DEP sends documents to Turing ES via the REST API. Available connectors include WebCrawler (Apache Nutch), Database, FileSystem, AEM/WEM, and WordPress. Refer to the [Dumont DEP documentation](/dumont) for connector configuration.
15722

15823
### Facets
15924

@@ -167,41 +32,9 @@ Through attributes defined in the contents, it is possible to use them to restri
16732

16833
Java API ([https://github.com/openviglet/turing-java-sdk](https://github.com/openviglet/turing-java-sdk)) facilitates the use and access to Viglet Turing ES, without the need for consumer search content with complex queries.
16934

170-
## Chatbot
171-
172-
Communicate with your client and elaborate complex intents, obtain reports and progressively evolve your interaction.
173-
174-
### Agent
175-
176-
Handles conversations with your end users. It is a natural language processing module that understands the nuances of human language.
177-
178-
### Intent
179-
180-
An intent categorizes an end user's intention for taking a conversation shift. For each agent, you define several intents, where your combined intents can handle a complete conversation.
181-
182-
### Actions
183-
184-
The field of action is a simple field of convenience that helps to execute logic in the service.
185-
186-
### Entity
187-
188-
Each intent parameter has a type, called an entity type, that dictates exactly how the data in an end user expression is extracted.
189-
190-
### Training
191-
192-
Defines and corrects intents.
193-
194-
### History
195-
196-
Shows the conversation history and reports.
197-
198-
## OpenText Blazon Integration
199-
200-
Turing ES detects Entities of OpenText Blazon Documents using OCR and NLP, generating Blazon XML to show the entities into document.
201-
20235
## Turing ES Console
20336

204-
Turing ES has many components: Search Engine, NLP, Converse (Chatbot), Semantic Navigation.
37+
Turing ES has many components: Search Engine, Semantic Navigation, and Generative AI.
20538

20639
### Login
20740

@@ -243,15 +76,13 @@ The Settings of Semantic Navigation Site contains the following attributes:
24376
| Name | Name of Semantic Navigation Site |
24477
| Description | Description of Semantic Navigation Site |
24578
| Search Engine | Select the Search Engine that was created in Search Engine Section |
246-
| NLP Vendor | NLP Vendor for this site |
24779
| Thesaurus | If will use Thesaurus |
24880

24981
##### Multi Languages Tab
25082

25183
| Attribute | Description |
25284
|-----------|-------------|
25385
| Language | Language for Semantic Navigation Site |
254-
| NLP Instance | NLP Instance to detect entities during indexing |
25586
| Core | Solr Core Name to store and to search indexed content |
25687

25788
##### Behavior Tab
@@ -291,13 +122,11 @@ Fields Tab contains a table with the following columns:
291122

292123
| Column Name | Description |
293124
|-------------|-------------|
294-
| Type | Type of Field: NER (Named Entity Recognition) or Search Engine (Solr) |
295125
| Field | Name of Field |
296126
| Enabled | If the field is enabled |
297127
| MLT | If this field will be used in MLT |
298128
| Facets | To use this field like a facet (filter) |
299129
| Highlighting | If this field will show highlighted lines |
300-
| NLP | If this field will be processed by NLP to detect Entities |
301130

302131
When clicking a Field, a details page appears with:
303132

@@ -314,7 +143,6 @@ When clicking a Field, a details page appears with:
314143
| Enabled | If the field is enabled |
315144
| Required | If the field is required |
316145
| Default Value | Default value if content is indexed without this field |
317-
| NLP | If this field will be processed by NLP |
318146

319147
##### Spotlight Details Tab
320148

0 commit comments

Comments
 (0)