Isds Ch 5

Only available on StudyMode
  • Download(s) : 353
  • Published : November 28, 2012
Open Document
Text Preview
Business Intelligence, 2e (Turban/Sharda/Delen/King)
Chapter 5 Text and Web Mining

1) DARPA and MITRE teamed up to develop capabilities to automatically filter text-based information sources to generate actionable information in a timely manner. Answer: TRUE
Diff: 2Page Ref: 190

2) A vast majority of business data is captured and stored in text documents that are structured. Answer: FALSE
Diff: 2Page Ref: 192

3) Text mining is important to competitive advantage because knowledge is power, and knowledge is derived from text data sources. Answer: TRUE
Diff: 2Page Ref: 192

4) The purpose and processes of text mining are different from those of data mining because with text mining the input to the process are data files such as Word documents, PDF files, text excerpts, and XML files. Answer: FALSE

Diff: 3Page Ref: 192

5) The benefits of text mining are greatest in areas where very large amounts of textual data are being generated, such as law, academic research, finance, and medicine. Answer: TRUE
Diff: 2Page Ref: 192

6) Unstructured data has a predetermined format. It is usually organized into records as categorical, ordinal, and continuous variables and stored in databases. Answer: FALSE
Diff: 2Page Ref: 193

7) Stemming is the process of reducing inflected words to their base or root form. Answer: TRUE
Diff: 1Page Ref: 193

8) Stop words, such as a, am, the, and was, are words that are filtered out prior to or after processing of natural language data. Answer: TRUE
Diff: 2Page Ref: 193

9) The goal of natural language processing (NLP) is syntax-driven text manipulation. Answer: FALSE
Diff: 2Page Ref: 196

10) Two advantages associated with the implementation of NLP are word sense disambiguation and syntactic ambiguity. Answer: FALSE
Diff: 2Page Ref: 196

11) By applying a learning algorithm to parsed text, researchers from Stanford University's NLP lab have developed methods that can automatically identify the concepts and relationships between those concepts in the text. Answer: TRUE

Diff: 2Page Ref: 197

12) Text mining can be used to increase cross-selling and up-selling by analyzing the unstructured data generated by call centers. Answer: TRUE
Diff: 1Page Ref: 200

13) Compared to polygraphs for deception-detection, text-based deception detection has the advantages of being nonintrusive and widely applicable to textual data and transcriptions of voice recordings. Answer: TRUE

Diff: 2Page Ref: 201

14) The main purpose of establishing the corpus is to collect all of the documents related to the context being studied. Answer: TRUE
Diff: 2Page Ref: 207

15) The main categories of knowledge extraction methods are recall, search, and signaling. Answer: FALSE
Diff: 2Page Ref: 210

16) Web pages consisting of unstructured textual data coded in HTML and logs of visitors' interactions provide rich data that can easily provide effective and efficient knowledge discovery. Answer: FALSE

Diff: 3Page Ref: 217

17) Web crawlers are Web content mining tools that are used to read through the content of a Web site automatically. Answer: FALSE
Diff: 1Page Ref: 218

18) Amazon.com leverages Web usage history dynamically and recognizes the user by reading a cookie written by a Web site on the visitor's computer. Answer: TRUE
Diff: 1Page Ref: 221

19) The quality of search results is impossible to measure accurately using strictly quantitative measures such as click-through rate, abandonment, and search frequency. Additional quantitative and qualitative measures are required. Answer: TRUE

Diff: 2Page Ref: 222

20) Customer experience management applications gather and report direct feedback from site visitors by benchmarking against other sites and offline channels, and by supporting predictive modeling of future visitor behavior. Answer: FALSE

Diff: 3Page Ref: 224

21) A vast majority of business data are stored in text documents...
tracking img