Choose any of the topics and answer the following: Topic 1: Introduction into data mining concepts. We focus on the importance of data algorithms and how different methods can derive different result


Choose any of the topics and answer the following:Topic 1: Introduction into data mining concepts.  We focus on the importance of data algorithms and how different methods can derive different results. Objectives: 

  1. Define the importance of understanding the differences in different data algorithms and the output variance.
  2. Explain how different output can occur when managing different data algorithms.
  3. Comprehend the various motivating challenges with data mining.
  4. Understand how data mining integrates with the various components of statistics, AL, ML, and Pattern Recognition.
  5. Explain the difference between predictive and descriptive tasks and the importance of each.

Topic 2: A use case on traditional data collection methods and the downfalls.  We also discuss data attributes and classification this week. Objectives: 

  1. Comprehend the traditional methods of data collection and the challenges of traditional methods compared to automated methods.
  2. Discuss the concepts of optimization and performance measurement in a real-world example.
  3. Understand the key components of attributes including the different types and the importance of each.
  4. Explain the difference between discrete and continuous data.
  5. Compare the pitfalls and benefits of model selection and evaluation.
  6. Explain the concepts in data classification.

Topic 3: Various types of classifiers used in data mining.  We also utilize a real-world example and discuss how opinion mining is used in information retrieval and is used with NLP techniques. Objectives: 

  1. Define the various types of classifiers.
  2. Understand the key components to logic regression.
  3. Compare and contrast nearest neighbor and naïve Bayes classifiers.
  4. Discuss a real-world example on opinion mining and how it is used in information retrieval.
  5. Explain the various components and techniques of opinion mining and the importance to transforming an organizations NLP framework.

Answer the following:

1. Define the concept.

2. Note its importance to data science.

3. Discuss corresponding concepts that are of importance to the selected concept.

4. Note a project where this concept would be used.

The paper should be between 2-3 pages and formatted using APA 7 format. Two peer-reviewed sources should be utilized to connect your thoughts to current published works.