Skip to main content

Artificial Intelligence

Artificial Intelligence or simply AI, could be defined as a technology that enables machines to simulate human learning, comprehension, problem-solving, decision-making, creativity and autonomy.

Beyond such introduction, there is no single, simple definition of Artificial Intelligence because AI tools are capable to performs tasks under varying and unpredictable circumstances without significant human oversight and can learn from experience and improve performance when exposed to data sets.

LogicalDOC contains a general purpose AI engine with which you can solve problems even not strictly related to document management, but with the advantage of being able to benefit from all the potential of a Document Management System to manage large volumes of data necessary for training.

Models

AI models are programs that implement an algorithm designed to solve a problem in the same way it would do a human brain, you can also look at them as artificial brains enabling systems to learn from data and perform tasks like analysis, prediction, and content generation.

At the time of writing, LogicalDOC supports this set of models:

  • Neural Network: useful to predict the category or nature of an object on the basis of input data
  • Classifier: uses Natural Language Processing(NLP) to catalog a naturally written text
  • Tokens Detector: uses Natural Language Processing(NLP) to extract tokens from a naturally written text

Samplers

Models cannot do anything without having been trained: like children, they must learn from experience in order to 'understand' how to solve a given problem.

In AI, this experience is built through a process called training that basically presents to the model a huge dataset of examples. The size and quality of the dataset impacts the model's ability to identify patterns in the data and therefore to understand the problem.

Samplers are those objects responsible for retrieving data used in training the models.

Samplers

A sampler is an object used to retrieve and prepare a dataset for the training of a model.

You handle the samplers in Administration > Artificial Intelligence > Models > Samplers

TITLE XXXX
 

You can count on different types of sampler with different settings:

SamplerDescriptionSettings
csv

Reads the contents of a CSV file extracting all the rows as string arrays.
Expected format of each resource is this one:

5.1,3.5,1.4,.2,"Setosa"
7,3.2,4.7,1.4,"Versicolor"
6.2,3.4,5.4,2.3,"Virginica"

This example will produce three rows of 5 elements each:

5.1, 3.5, 1.4, .2, Setosa
7, 3.2, 4.7, 1.4, Versicolor
6.2, 3.4, 5.4, 2.3, Virginica
  • Delimiter: the character used as fields delimiter
  • Quote: the character used to enclose the value of a field
  • Document: the CSV document that contains the data
paragraph

Extracts the paragraphs, interpreted as blocks of text separated by blank lines.
Expected format of each resource is this one:

A colleague of mine told me that the document 12356897 contains very important information, so I want to get it. Understood, but are you registered as LogicalDOC's user? If you are a user, just access the interface and then execute a search by document id = 12356897.

Where can I locate a specific file? I was not able to find what I was looking for. Ok, just enter LogicalDOC and search for document with ID -96668429, it is very easy. Sure! Easy and quick, many thanks for your hint.

The example above will produce two paragraphs.

  • Document: the text document that contains the data
metadataExtract samples from a list of documents. By default the extended attributes of the documents are considered as the features, and so all the documents in the referenced folder must share the same attributes scheme. With the Automation you may also extract whatever data for each document.
  • Folder: the folder that contains the documents to process
  • Category: name of the extended attribute that contains the category, optional
  • Features: ordered comma-separated list name of extended attributes used to store the feature values
  • Automation: an automation script used to extract a sample from a source document accessible via the dictionary key $document
chainCollects the samples extracted by a collection of other samplers
  • Chain: ordered list of samplers

Snippets

link to a menu

 

Warning

Body

 

Info

Body

Info

Body

download 

 

Include Drop Table:

AUTOMATION CONTEXT: VALIDATION
VariableJava ClassDescription
objectExtensibleObjectthe object under validation, for instance a Document or a Folder
eventHistorythe current event
errorsmap of strings. Key is the attribute name, Value is the error description
attributeNamename of the current attribute being validated (only available in attribute validator)
attributeAttributethe current attribute being validated (only available in attribute validator)
valuevalue of the current attribute being validated (only available in attribute validator)
errorValidationErrorkeeps the error description of the current attribute being validated (only available in attribute validator)

 

Put an automation dictionary:

Dictionary available for the Automation in this context

AUTOMATION CONTEXT: EMAIL IMPORT
VariableJava ClassDescription
email

EMail

the email being imported
account

EmailAccount

the account from which the email is being imported
messagethe message representation as returned by the mail server
document

Document

the document that will be used to save the email in the repository
documents

Document

list of documents that will be used to save the email and it's attachments in the repository
importsPair<EMail, List<Document>>

list of imported messages (Pair email - list of imported documents) during the crawling of the current email account

errors

Pair<String, Exception>

list of errors (Pair error nessage - exception) happened during the crawling of the current email account

Read the Automation manual for more information.

 

Include a video from YouTube:

 

Put an image as lightbox:

TITLE XXXX
 

TITLE XXXX
 

 

Pre-formatted text:

$ sudo apt-get install ghostscript
$ sudo apt-get install imagemagick
$ whereis convert
$ convert –version
 
Minimum Hardware Requirements
Server 2.4 GHz 32-bit (x86) or 64-bit (x64) dual core processor
6 GB of system memory
1 GB of storage for the application data and repository storage as needed.
Workstations 2.0 GHz 32-bit (x86) or 64-bit (x64) processor
1 GB of system memory
50 MB storage space
Broadband connection to server
Minimum Software Requirements
LogicalDOC Server Installer Package Red Hat Enterprise Linux 7 (32-bit and 64-bit)
SUSE Linux Enterprise Server 10 (32-bit and 64-bit)
CentOS Linux 6 (32-bit and 64-bit)
Ubuntu 14 (32-bit and 64-bit)
Microsoft Windows 2008 Server (32-bit and 64-bit)
Microsoft Windows 7 (32-bit and 64-bit)
MAC OS X Lion (OS X 10.8)
Java JDK 8
Supported Browsers Firefox 40
Internet Explorer 11
Edge 25
Google Chrome 40
Safari 6
 

Borrar la Cache del navegador

Each time your installation is upgraded, you have to clear the Web browser cache in order to force the browser to load the latest version of the Graphical User Interface.

Internet Explorer
  1. Click Tools from the Internet Explorer menubar
  2. Click Internet Options
  3. Click the General tab
  4. Check Temporary Internet files item
  5. Click on Delete.
Firefox
  1. At the top of the Firefox window, click on the Firefox button and then click Options
  2. Select the Advanced panel.
  3. Click on the Network tab.
  4. In the Cached Web Content section, click Clear Now.
  5. Click OK to close the Options window
Chrome
  1. Click the wrench icon on browser tool bar
  2. Select Tools
  3. Select Clear browsing data
  4. In the dialog that appears, select the checkboxes for the types of information that you want to remove
  5. Use the menu at the top to select the amount of data that you want to delete. Select beginning of time to delete everything.
  6. Click Clear browsing data
Safari
  1. In the Safari menu bar, click the Safari option
  2. Select the Preferences link
  3. In the Preferences window, click the Advanced tab
  4. At the bottom of the window, select the Show Develop menu in menu bar checkbox
  5. Return to the Safari menu bar and click the Develop option
  6. Select the Empty Caches link
Opera
  1. Click Edit from the Ope ra menubar
  2. Click Preferences from the File menu
  3. Click the History and Cache menu
  4. Click Cache
  5. Click ok to close the Preferences menu

Guía de Instalación

LogicalDOC permite centralizar toda la información de la empresa en un único punto de acceso, garantizando la seguridad de los datos. De esta manera, sólo los usuarios autorizados pueden tener acceso a datos específicos. Al mismo tiempo, ofrece un servicio de auditoría completa. LogicalDOC es una aplicación Web 2.0 que funciona con Internet Explorer, Firefox, Safari y Chrome. Además, las copias de seguridad periódicas evitan pérdidas de información.

Sin embargo, una de las funciones más relevantes de mencionar es la indexación de los tipos más comunes de archivos: texto, Office, OpenOffice/LibreOffice, PDF, HTML, XML, JPEG, etc.
Su potente motor de búsqueda indexa todos los documentos por lo que es fácil encontrar cualquier información.