CBA Spotlight Technical Information Guide

Spotlight Technical Information Guide Date: October 26, 2016 Co-2-Co SPOTLIGHT Technical Information Guide © 2017 CO-...

0 downloads 152 Views 280KB Size
Spotlight Technical Information Guide

Date:

October 26, 2016

Co-2-Co SPOTLIGHT Technical Information Guide © 2017 CO-2-CO Inc. – All Rights Reserved

10/26/2016 Page: 1

Introduction Co-2-Co Spotlight makes it much easier to search for information in legacy files. These files may originate from scanned hardcopy or be stored on existing company file servers. Advanced linguistic, taxonomy and pattern matching technologies are leveraged to extract useful structured information from the content of unstructured files. The organizations specific business taxonomy is used to evaluate the importance and relevancy of every file. The resultant structured metadata for each file can be used to migrate to a new IT environment or simply provide a powerful search capability to the files in their original location. The Spotlight system can be deployed as a cloud or on premise solution. The cloud solution has two offerings, standard and enterprise. The standard offering is offered on a shared, multi-tenanted infrastructure whereas enterprise is private and via dedicated barebone servers.

Technical Architecture

When running the Cloud version, the only Spotlight software component that needs to be installed on a customer system is the Spotlight Scanner. The scanner is software that scans all the files and then connects over the network to the Spotlight Server to process files one by one. One or more Spotlight Processor components perform the analysis on the contents of the customer files and store the resultant information in a database. None of the customer’s files are retained on the Spotlight server, just index information supporting search. End users can use the Spotlight web application to search using the search database in the cloud and that database references files residing within the customers own file systems. The Enterprise option of the cloud and on premise versions supports integration with the customer Active Directory for authentication and Single Sign On capability. When using the On-Premise version, the Spotlight Server, Processors and Database are installed on computers within the customer’s IT environment. Spotlight can process a very broad range of file types, from scanned hardcopy, photographs, office document formats, PDF, email messages and CAD drawings.

Co-2-Co SPOTLIGHT Technical Information Guide © 2017 CO-2-CO Inc. – All Rights Reserved

10/26/2016 Page: 2

Technical Requirements STANDARD AND ENTERPRISE CLOUD DEPLOYMENT The only component to be installed for Cloud deployment is the Spotlight Scanner. This can run on a Windows workstation or a server. Only one instance of a scanner can run on a single workstation or server but multiple scanners across different workstations and servers is supported. The pre-requisites are Windows 7 and above operating system on a Windows workstation and Windows Server 2008 and above on a Windows server. The Scanner is a Java program and requires a Java runtime environment (JVM) at version 8 and above is present on the workstation or server. The scanner maintains a local cache database of information during its processing – for each million files scanned allow 50GB of storage for this cache.

STANDARD AND ENTERPRISE ON PREMISE DEPLOYMENT The information above for the Spotlight Scanner is the same for the On-Premise deployment. The Spotlight Server will need to be installed on Windows Server 2012 and above. We recommend a virtual or real Intel Xeon server (3Ghz and above) with at least 4 cores and minimum of 32Gb DDR3 or DDR4 ram. We recommend SSD storage (or faster) with capacity as below:

Quantity of Files scanned

SSD Storage Requirement (GB)

Up to 250k

200

Up to 1m

500

Up to 5m

1,000

The Spotlight Server is a Java program and requires a Java run-time environment at version 8 and above is present on the server. The Spotlight Server requires Apache Tomcat v8, Apache SOLR v6 and MySQL v5.7 as pre-requisites. The Spotlight Processor(s) will need to be installed on Windows Server 2012 and above. We recommend a virtual or real Intel Xeon server (3Ghz and above) with at least 24 cores and minimum of 128Gb DDR3 or DDR4 ram. A high degree of parallel processing is required to support the efficient processing of the files. Increased throughput can be achieved by adding further Spotlight Processor servers to the overall configuration. The Spotlight Processor is a Java program and requires a Java run-time environment at version 8 and above is present on the server. The Spotlight Server requires Apache Tomcat v8 as a pre-requisite. For a small test configuration (up to 10k files) the Spotlight Server and Processor can be resident on the same server.

Co-2-Co SPOTLIGHT Technical Information Guide © 2017 CO-2-CO Inc. – All Rights Reserved

10/26/2016 Page: 3