Section-1: Data Science - The "What"
Chapter-1: IntroductionFirst chapter will set the basic foundation of the subject for students. Like many other books, this introductory level chapter will comprise of the basic concepts. Introduction of the following concepts will be discussed:* Data Science* Importance of data science* Applications of data science* Data Driven Decision Making* Data analysisChapter-2: Widely used techniques in data scienceThis chapter will discuss the concepts required for one to start working on data analysis. Chapter will comprise of the concepts that student should know before performing any task on data analysis and some of the tasks that can be performed as part of data analysis. Following concepts will be discussed.* Supervised vs Unsupervised data* Data understanding* Data preparation* Modeling* Overfitting* Random sampling* Cross Validation* Feature selection* Outlier detection* Rule extractionSection-2: Data science: The "How"
Chapter-3: Statistical InferenceEvery part of data analysis involves statistics and statistical inference to properly utilize data and perform decision making. This chapter will provide statistical concepts to support the data analysis tasks performed by students for decision making with real life data. Following topics will be discussed:* Probability theory* Transformations and expectations* Common families of distribution* Random variables* Preparation of random samples* Asymptotic evaluations* Regression and regression models
Chapter-4: Supervised Learning In real world, we come across two types of data, supervised and unsupervised. In this chapter, we will discuss the concepts, tools and techniques related to processing of supervised data with examples and decision making out of it. The following concepts will be discussed:* Supervised Learning* Classification and Regression* Generalization, Overfitting and Underfitting* Evaluation models* Supervised learning algorithmsChapter-5: Unsupervised LearningThe unsupervised data forms the other half of the data available in real world applications. Like previous chapter, this chapter will include the concepts, tools and techniques related to unsupervised data with examples. Following contents will be included:* Challenges of unsupervised learning* Processing and scaling* Clustering* Dimensionality reduction, feature extraction and manifold learning* Unsupervised learning algorithmsChapter-6: Natural language processingIn this chapter, we will focus on one particular sort of data that has become extremely common i.e. text data. We will see in this chapter the fundamental principles of natural language processing and will look at one of the common application of NLP that is sentiment analysis. Following contents will be discussed:* Why Text Is Important* Why Text Is Difficult* Representation* Sentiment Analysis* Lexicon-based Approaches for Text MiningSection-3: Data Science - The "Where"
Chapter-7: Customers AnalyticsIn this chapter, we will introduce he use of analytics for understanding customers and predicting their behaviour in different situations. This includes the understanding of loyalty programs, market research, understanding customer lifetime value, predicting churn, and identifying potential defaulters. These are few examples of what will be contained in this chapter.
Chapter-8: Operations AnalyticsIn this chapter, we will prepare our readers to understand and acknowledge the use of data science for improving business operations. For example, we will discuss how analyzing data can help avoid service outages, or at least predict the service outage in order to prepare contingency plans. Analyzing data can also help in identifying redundancies which can be removed in order to significantly reduce operational costs. We will give examples on how various manufacturing and service industries are using real-time sensor data to track their systems wear and tear. This helps them improve their mean time to repair by forecasting breakdown of different components well ahead in time.