Tab navigation
- Overview
- Objectives- selected tab,
- Test preparation
The test contains 16 sections totalling 71 multiple-choice questions. The percentages after each section title reflect the approximate distribution of the total question set across the sections.
Section 1 - Architecture, Terminology and Concepts (1%)
- Information data flow
- ICA System components (crawler, document processors, indexes, search run time, text miner application, admin console, search application)
- Common ICA terms (collection, document processing, annotators, dictionary, crawler, facets, correlation, deviations, frequency, UIMA, LanguageWare)
Section 2 - Collection Design, Planning and Preparation (11%)
- Collection design, organizing information into collections
- Logical Document Model, search fields, and facets
- Data Sources (structured and unstructured) and search field mapping
- Planning and field mapping for dictionaries, categorization, and additional annotators
- Planning for integration of IBM Classification Module
- Date field formats and usage
- When ETL is needed
Section 3 - Installation and Initial Configuration (4%)
- System planning and sizing
- Adding search or document processing nodes
- Deployment on WebSphere
- All in one (basic) and complex installation (multi-node systems concepts)
- Pre-reqs, Supported operating systems
- Upgrades
- Agent Server installation
- Post Installation
- High Availability & Disaster Recovery
Section 4 - Crawling and Importing (11%)
- Different types of crawlers (web, file sys, collaboration, database, content repos, portal, etc..)
- Obtaining fielded metadata from crawled documents
- Importing CSV files
- Crawler scheduling configuration
- XML and HTML Mappings
- Crawler plugin capabilities
- Field filters
Section 5 - Document Processing and Parsing (10%)
- Document Text Extraction
- Language Identification
- Named Entity Extractions
- Parts of Speech
- UIMA
- Terms of Interest (predicate and entity)
- Document Clusters (proposals) and results
- Creating and configuring dictionaries and synonyms (built-in)
- Classification - URI and rule-based
- Duplicate Detection
- Pipeline configuration (how to hook up ICM conf.)
- Annotator configuration (using LRW and non LRW)
- Dictionaries (command line dictionaries - synonyms, stop words, boost word)
Section 6 - Security (6%)
- Administrator access control
- Collection access control
- User Authentication
- LDAP and single-sign-on
- Configure user roles
- Database connections
- Source repository connections
- Search security (document level)
Section 7 - Text Analytics Customizing (10%)
- Dictionaries
- Parsing, break and character rules
- Annotations and features
- Integration with ICA
- Collaborating/Sharing text analytics models
- Using Languageware Resource Workbench (LRW)
Section 8 - Exporting Data (4%)
- Exporting crawled data
- Exporting after text analytics pipeline
- Exporting search results
- Exporting to a relational database and the file system
- Exporting simple CSV
- Deep Inspection
- Export Plugin Capabilities
- Exporting to ICM
Section 9 - Scalability and Performance (4%)
- Adding document processors
- Adding search servers
- Performance considerations
Section 10 - Search Application (6%)
- Search syntax
- Major features and functions (e.g., query type ahead, synonyms, did you mean, site collapse, facets)
- Search customizer
- Customization via preferences
- Ranking optimization (results ranking)
Section 11 - Text Miner Application (7%)
- Text miner views
- How to identify trends, patterns
- Query navigation (saving, forward/backward, query tree)
- Query builder
- Analytics customizer
- Dashboarding
- Date fields
- Rule based categories
- Flags
- Preferences (duplicate detection)
Section 12 - Rest API (3%)
- Real time NLP API
- REST API (Search and Admin)
Section 13 - Integration with External Systems (3%)
- Portal integration, Cognos, Netezza, ICM, SPSS
- Common use cases (BPM, content assessment …)
Section 14 - Plugin development (3%)
- Crawler plugin
- Export plugin
- Text Miner plugin
- Post filtering search results plugin
Section 15 - Troubleshooting (6%)
- Dropped Documents
- Failed Crawling
- System logs and monitoring (Enabling Logs and Log locations)
- Monitoring System
- Backup and Restore
Section 16 - Administration (11%)
- ES Admin Commands
- System administration
- Collection creation administration
- Crawler administration
- Parser administration
- Language and code page support
- Index administration for search collections
- Search server administration
- Search applications
