Skip to main content

Artificial Intelligence

Artificial Intelligence or simply AI, could be defined as a technology that enables machines to simulate human learning, comprehension, problem-solving, decision-making, creativity and autonomy.

Beyond such introduction, there is no single, simple definition of Artificial Intelligence because AI tools are capable to performs tasks under varying and unpredictable circumstances without significant human oversight and can learn from experience and improve performance when exposed to data sets.

LogicalDOC contains a general purpose AI engine with which you can solve problems even not strictly related to document management, but with the advantage of being able to benefit from all the potential of a Document Management System to manage large volumes of data necessary for training.

Models

AI models are programs that implement an algorithm designed to solve a problem in the same way it would do a human brain, you can also look at them as artificial brains enabling systems to learn from data and perform tasks like analysis, prediction, and content generation.

At the time of writing, LogicalDOC supports this set of models:

  • Neural Network: useful to predict the category or nature of an object on the basis of input data
  • Classifier: uses Natural Language Processing(NLP) to catalog a naturally written text
  • Tokens Detector: uses Natural Language Processing(NLP) to extract tokens from a naturally written text

Samplers

Models cannot do anything without having been trained: like children, they must learn from experience in order to 'understand' how to solve a given problem.

In AI, this experience is built through a process called training that basically presents to the model a huge dataset of examples. The size and quality of the dataset impacts the model's ability to identify patterns in the data and therefore to understand the problem.

Samplers are those objects responsible for retrieving data used in training the models.

Samplers

A sampler is an object used to retrieve and prepare a dataset for the training of a model.

You handle the samplers in Administration > Artificial Intelligence > Models > Samplers

TITLE XXXX
 

You can count on different types of sampler with different settings:

SamplerDescriptionSettings
csv

Reads the contents of a CSV file extracting all the rows as string arrays.
Expected format of each resource is this one:

5.1,3.5,1.4,.2,"Setosa"
7,3.2,4.7,1.4,"Versicolor"
6.2,3.4,5.4,2.3,"Virginica"

This example will produce three rows of 5 elements each:

5.1, 3.5, 1.4, .2, Setosa
7, 3.2, 4.7, 1.4, Versicolor
6.2, 3.4, 5.4, 2.3, Virginica
  • Delimiter: the character used as fields delimiter
  • Quote: the character used to enclose the value of a field
  • Document: the CSV document that contains the data
paragraph

Extracts the paragraphs, interpreted as blocks of text separated by blank lines.
Expected format of each resource is this one:

A colleague of mine told me that the document 12356897 contains very important information, so I want to get it. Understood, but are you registered as LogicalDOC's user? If you are a user, just access the interface and then execute a search by document id = 12356897.

Where can I locate a specific file? I was not able to find what I was looking for. Ok, just enter LogicalDOC and search for document with ID -96668429, it is very easy. Sure! Easy and quick, many thanks for your hint.

The example above will produce two paragraphs.

  • Document: the text document that contains the data
metadataExtract samples from a list of documents. By default the extended attributes of the documents are considered as the features, and so all the documents in the referenced folder must share the same attributes scheme. With the Automation you may also extract whatever data for each document.
  • Folder: the folder that contains the documents to process
  • Category: name of the extended attribute that contains the category, optional
  • Features: ordered comma-separated list name of extended attributes used to store the feature values
  • Automation: an automation script used to extract a sample from a source document accessible via the dictionary key $document
chainCollects the samples extracted by a collection of other samplers
  • Chain: ordered list of samplers

Snippets

link to a menu

 

Warning

Body

 

Info

Body

Info

Body

download 

 

Include Drop Table:

AUTOMATION CONTEXT: VALIDATION
VariableJava ClassDescription
objectExtensibleObjectthe object under validation, for instance a Document or a Folder
eventHistorythe current event
errorsmap of strings. Key is the attribute name, Value is the error description
attributeNamename of the current attribute being validated (only available in attribute validator)
attributeAttributethe current attribute being validated (only available in attribute validator)
valuevalue of the current attribute being validated (only available in attribute validator)
errorValidationErrorkeeps the error description of the current attribute being validated (only available in attribute validator)

 

Put an automation dictionary:

Dictionary available for the Automation in this context

AUTOMATION CONTEXT: EMAIL IMPORT
VariableJava ClassDescription
email

EMail

the email being imported
account

EmailAccount

the account from which the email is being imported
messagethe message representation as returned by the mail server
document

Document

the document that will be used to save the email in the repository
documents

Document

list of documents that will be used to save the email and it's attachments in the repository
importsPair<EMail, List<Document>>

list of imported messages (Pair email - list of imported documents) during the crawling of the current email account

errors

Pair<String, Exception>

list of errors (Pair error nessage - exception) happened during the crawling of the current email account

Read the Automation manual for more information.

 

Include a video from YouTube:

 

Put an image as lightbox:

TITLE XXXX
 

TITLE XXXX
 

 

Pre-formatted text:

$ sudo apt-get install ghostscript
$ sudo apt-get install imagemagick
$ whereis convert
$ convert –version
 
Minimum Hardware Requirements
Server 2.4 GHz 32-bit (x86) or 64-bit (x64) dual core processor
6 GB of system memory
1 GB of storage for the application data and repository storage as needed.
Workstations 2.0 GHz 32-bit (x86) or 64-bit (x64) processor
1 GB of system memory
50 MB storage space
Broadband connection to server
Minimum Software Requirements
LogicalDOC Server Installer Package Red Hat Enterprise Linux 7 (32-bit and 64-bit)
SUSE Linux Enterprise Server 10 (32-bit and 64-bit)
CentOS Linux 6 (32-bit and 64-bit)
Ubuntu 14 (32-bit and 64-bit)
Microsoft Windows 2008 Server (32-bit and 64-bit)
Microsoft Windows 7 (32-bit and 64-bit)
MAC OS X Lion (OS X 10.8)
Java JDK 8
Supported Browsers Firefox 40
Internet Explorer 11
Edge 25
Google Chrome 40
Safari 6
 

Cancella cache del browser

Dopo ogni aggiornamento di versione, devi cancellare la cache del browser in modo da fargli ricaricare l'ultima versione dell'Interfaccia Grafica.

Internet Explorer
  1. Clicca su Strumenti dalla barra menu di Internet Explorer
  2. Clicca su Opzioni Interner
  3. Clicca sulla linguetta Generale
  4. Seleziona la voce File Internet temporanei
  5. Clicca su Cancella.

 

Firefox
  1. In alto nella finestra di Firefox, clicca il bottone Firefox e poi clicca su Opzioni
  2. Seleziona il pannello Avanzate.
  3. Clicca sulla linguetta Rete.
  4. Nella sezione Cached Web Content, clicca Cancella Ora.
  5. Clicca su OK per chiudere la finestra delle opzioni
Chrome
  1. Clicca l'icona chiave nella tool bar del browser
  2. Seleziona Strumenti
  3. Seleziona Cancella dati navigazione
  4. Nella finestra di dialogo che appare, seleziona tutte le caselle
  5. Usa il menu in alto seleziona all'inizio del tempo per cancellare tutto.
  6. Clicca Cancella dati navigazione
Safari
  1. Nella barra dei menu di Safari, fare clic sull'opzione Safari
  2. Selezionare il collegamento Preferenze
  3. Nella finestra Preferenze, fare clic sulla scheda Avanzate
  4. Nella parte inferiore della finestra, selezionare il menu Mostra Sviluppo nella checkbox della barra dei menu
  5. Ritorna alla barra dei menu Safari e fare clic sull'opzione Sviluppo
  6. Selezionare il collegamento Svuota Cache
Opera
  1. Clicca Modifica dalla barra del menu
  2. Clicca Preferenze dal menu File
  3. Clicca il menu History e Cache
  4. Clicca Cache
  5. Clicca ok per chiudere il menu delle preferenze

Guida di Installazione

LogicalDOC consente di centralizzare tutte le informazioni aziendali in un unico punto di accesso, garantendo la sicurezza dei dati. In questo modo, solo gli utenti autorizzati possono accedere a dati specifici. Allo stesso tempo, fornisce un servizio completo di revisione. LogicalDOC è un'applicazione Web 2.0 che funziona con Internet Explorer, Firefox, Safari e Chrome. Inoltre, backup periodici evitano perdite di informazioni.

Tuttavia, una delle funzioni più importanti da menzionare è l'indicizzazione dei più comuni tipi di file: testo, Office, Office 2007, OpenOffice/LibreOffice, PDF, HTML, XML, JPEG, ecc. Il suo potente motore di ricerca indicizza tutti i documenti rendendo facile ritrovare tutte le informazioni.