Amazon cover image
Image from Amazon.com

Data Science Revealed : With Feature Engineering, Data Visualization, Pipeline Development, and Hyperparameter Tuning / by Tshepo Chris Nokeri.

By: Contributor(s): Material type: TextTextPublication details: United States: Apress, c 2021.Edition: 1st ed. 2021Description: 1 online resource (XX, 252 p. 95 illus.) online resource. 26 cmISBN:
  • 9781484268704
  • 9781484268698
  • 9781484268711
Subject(s): Genre/Form: Additional physical formats: Printed edition:: No title; Printed edition:: No titleDDC classification:
  • 006.312 23 NOK-D 2021 789526
Online resources:
Contents:
Chapter 1: An Introduction to Simple Linear Regression Analysis -- Chapter 2: Advanced Parametric Methods -- Chapter 3: Time Series Analysis -- Chapter 4: High-Quality Time Series Analysis -- Chapter 5: Logistic Regression Analysis -- Chapter 6: Dimension Reduction and Multivariate Analysis Using Linear Discriminant Analysis -- Chapter 7: Finding Hyperplanes Using Support Vectors -- Chapter 8: Classification Using Decision Trees -- Chapter 9: Back to the Classics -- Chapter 10: Cluster Analysis -- Chapter 11: Survival Analysis -- Chapter 12: Neural Networks -- Chapter 13: Machine Learning Using H2O.
In: Springer Nature eBookSummary: Get insight into data science techniques such as data engineering and visualization, statistical modeling, machine learning, and deep learning. This book teaches you how to select variables, optimize hyper parameters, develop pipelines, and train, test, and validate machine and deep learning models. Each chapter includes a set of examples allowing you to understand the concepts, assumptions, and procedures behind each model. The book covers parametric methods or linear models that combat under- or over-fitting using techniques such as Lasso and Ridge. It includes complex regression analysis with time series smoothing, decomposition, and forecasting. It takes a fresh look at non-parametric models for binary classification (logistic regression analysis) and ensemble methods such as decision trees, support vector machines, and naive Bayes. It covers the most popular non-parametric method for time-event data (the Kaplan-Meier estimator). It also covers ways of solving classification problems using artificial neural networks such as restricted Boltzmann machines, multi-layer perceptrons, and deep belief networks. The book discusses unsupervised learning clustering techniques such as the K-means method, agglomerative and Dbscan approaches, and dimension reduction techniques such as Feature Importance, Principal Component Analysis, and Linear Discriminant Analysis. And it introduces driverless artificial intelligence using H2O. After reading this book, you will be able to develop, test, validate, and optimize statistical machine learning and deep learning models, and engineer, visualize, and interpret sets of data. You will: Design, develop, train, and validate machine learning and deep learning models Find optimal hyper parameters for superior model performance Improve model performance using techniques such as dimension reduction and regularization Extract meaningful insights for decision making using data visualization.
Tags from this library: No tags from this library for this title. Log in to add tags.
Star ratings
    Average rating: 0.0 (0 votes)
Holdings
Item type Current library Collection Call number Copy number Status Date due Barcode Item holds
Books Books Faculty of CS & IT Library CS & IT Shelf No. 44 New Arrival Book 006.312 NOK-D 2021 789526 (Browse shelf(Opens below)) C 1 Available 789526
Books Books Faculty of CS & IT Library CS & IT Shelf No. 44 New Arrival Book 006.312 NOK-D 2021 789588 (Browse shelf(Opens below)) C 2 Available 789588
Total holds: 0

Index.

Chapter 1: An Introduction to Simple Linear Regression Analysis -- Chapter 2: Advanced Parametric Methods -- Chapter 3: Time Series Analysis -- Chapter 4: High-Quality Time Series Analysis -- Chapter 5: Logistic Regression Analysis -- Chapter 6: Dimension Reduction and Multivariate Analysis Using Linear Discriminant Analysis -- Chapter 7: Finding Hyperplanes Using Support Vectors -- Chapter 8: Classification Using Decision Trees -- Chapter 9: Back to the Classics -- Chapter 10: Cluster Analysis -- Chapter 11: Survival Analysis -- Chapter 12: Neural Networks -- Chapter 13: Machine Learning Using H2O.

Get insight into data science techniques such as data engineering and visualization, statistical modeling, machine learning, and deep learning. This book teaches you how to select variables, optimize hyper parameters, develop pipelines, and train, test, and validate machine and deep learning models. Each chapter includes a set of examples allowing you to understand the concepts, assumptions, and procedures behind each model. The book covers parametric methods or linear models that combat under- or over-fitting using techniques such as Lasso and Ridge. It includes complex regression analysis with time series smoothing, decomposition, and forecasting. It takes a fresh look at non-parametric models for binary classification (logistic regression analysis) and ensemble methods such as decision trees, support vector machines, and naive Bayes. It covers the most popular non-parametric method for time-event data (the Kaplan-Meier estimator). It also covers ways of solving classification problems using artificial neural networks such as restricted Boltzmann machines, multi-layer perceptrons, and deep belief networks. The book discusses unsupervised learning clustering techniques such as the K-means method, agglomerative and Dbscan approaches, and dimension reduction techniques such as Feature Importance, Principal Component Analysis, and Linear Discriminant Analysis. And it introduces driverless artificial intelligence using H2O. After reading this book, you will be able to develop, test, validate, and optimize statistical machine learning and deep learning models, and engineer, visualize, and interpret sets of data. You will: Design, develop, train, and validate machine learning and deep learning models Find optimal hyper parameters for superior model performance Improve model performance using techniques such as dimension reduction and regularization Extract meaningful insights for decision making using data visualization.

Copyrights 2018© The University of Lahore (UOL) Libraries. All Rights Reserved. Library System Administrator Muhammad Riaz (muhammad.riaz@uol.edu.pk) +92 (0)42 35963421-30 Ext: 1703

Powered by Koha