Skip to main content

Test 000-583: IBM Content Analytics and Search V2.2

Tab navigation

The test contains 16 sections totalling 71 multiple-choice questions. The percentages after each section title reflect the approximate distribution of the total question set across the sections.  

Section 1 - Architecture, Terminology and Concepts (1%)

  1. Information data flow
  2. ICA System components (crawler, document processors, indexes, search run time, text miner application, admin console, search application)
  3. Common ICA terms (collection, document processing, annotators, dictionary, crawler, facets, correlation, deviations, frequency, UIMA, LanguageWare)


Section 2 - Collection Design, Planning and Preparation (11%)

  1. Collection design, organizing information into collections
  2. Logical Document Model, search fields, and facets
  3. Data Sources (structured and unstructured) and search field mapping
  4. Planning and field mapping for dictionaries, categorization, and additional annotators
  5. Planning for integration of IBM Classification Module
  6. Date field formats and usage
  7. When ETL is needed


Section 3 - Installation and Initial Configuration (4%)

  1. System planning and sizing
  2. Adding search or document processing nodes
  3. Deployment on WebSphere
  4. All in one (basic) and complex installation (multi-node systems concepts)
  5. Pre-reqs, Supported operating systems
  6. Upgrades
  7. Agent Server installation
  8. Post Installation
  9. High Availability & Disaster Recovery


Section 4 - Crawling and Importing (11%)

  1. Different types of crawlers (web, file sys, collaboration, database, content repos, portal, etc..)
  2. Obtaining fielded metadata from crawled documents
  3. Importing CSV files
  4. Crawler scheduling configuration
  5. XML and HTML Mappings
  6. Crawler plugin capabilities
  7. Field filters


Section 5 - Document Processing and Parsing (10%)

  1. Document Text Extraction
  2. Language Identification
  3. Named Entity Extractions
  4. Parts of Speech
  5. UIMA
  6. Terms of Interest (predicate and entity)
  7. Document Clusters (proposals) and results
  8. Creating and configuring dictionaries and synonyms (built-in)
  9. Classification - URI and rule-based
  10. Duplicate Detection
  11. Pipeline configuration (how to hook up ICM conf.)
  12. Annotator configuration (using LRW and non LRW)
  13. Dictionaries (command line dictionaries - synonyms, stop words, boost word)


Section 6 - Security (6%)

  1. Administrator access control
  2. Collection access control
  3. User Authentication
  4. LDAP and single-sign-on
  5. Configure user roles
  6. Database connections
  7. Source repository connections
  8. Search security (document level)


Section 7 - Text Analytics Customizing (10%)

  1. Dictionaries
  2. Parsing, break and character rules
  3. Annotations and features
  4. Integration with ICA
  5. Collaborating/Sharing text analytics models
  6. Using Languageware Resource Workbench (LRW)


Section 8 - Exporting Data (4%)

  1. Exporting crawled data
  2. Exporting after text analytics pipeline
  3. Exporting search results
  4. Exporting to a relational database and the file system
  5. Exporting simple CSV
  6. Deep Inspection
  7. Export Plugin Capabilities
  8. Exporting to ICM


Section 9 - Scalability and Performance (4%)

  1. Adding document processors
  2. Adding search servers
  3. Performance considerations


Section 10 - Search Application (6%)

  1. Search syntax
  2. Major features and functions (e.g., query type ahead, synonyms, did you mean, site collapse, facets)
  3. Search customizer
  4. Customization via preferences
  5. Ranking optimization (results ranking)


Section 11 - Text Miner Application (7%)

  1. Text miner views
  2. How to identify trends, patterns
  3. Query navigation (saving, forward/backward, query tree)
  4. Query builder
  5. Analytics customizer
  6. Dashboarding
  7. Date fields
  8. Rule based categories
  9. Flags
  10. Preferences (duplicate detection)


Section 12 - Rest API (3%)

  1. Real time NLP API
  2. REST API (Search and Admin)


Section 13 - Integration with External Systems (3%)

  1. Portal integration, Cognos, Netezza, ICM, SPSS
  2. Common use cases (BPM, content assessment …)


Section 14 - Plugin development (3%)

  1. Crawler plugin
  2. Export plugin
  3. Text Miner plugin
  4. Post filtering search results plugin


Section 15 - Troubleshooting (6%)

  1. Dropped Documents
  2. Failed Crawling
  3. System logs and monitoring (Enabling Logs and Log locations)
  4. Monitoring System
  5. Backup and Restore


Section 16 - Administration (11%)

  1. ES Admin Commands
  2. System administration
  3. Collection creation administration
  4. Crawler administration
  5. Parser administration
  6. Language and code page support
  7. Index administration for search collections
  8. Search server administration
  9. Search applications


Register for a test

Register for an IBM Certification test at Prometric and take a step into your future.