Data mining and warehousing by S. Prabhu, N. Venatesan

By S. Prabhu, N. Venatesan

Show description

Read or Download Data mining and warehousing PDF

Similar mining books

A Room For The Summer: Adventure, Misadventure, And Seduction In The Mines Of The Coeur D'Alene

In A Room for the summer season, Fritz Wolff takes the reader on a memorable trip into the rough-and-tumble international of hardrock mining, recounting his stories either above and less than flooring as an apprentice engineer through the past due Fifties. In June 1956, on the age of eighteen, Wolff went to paintings for the Bunker Hill corporation in Kellogg, Idaho, within the Coeur d’Alene sector.

Deepwater Petroleum Exploration & Production: A Nontechnical Guide

Textual content overviews the company, engineering, and expertise of deepwater petroleum exploration and creation. presents assurance of all facets of deepwater operations: together with historical historical past; drilling and finishing wells; improvement structures; mounted buildings; floating creation structures; subsea structures; topsides; and pipelines, flowlines, and risers

Fundamentals of Coalbed Methane Reservoir Engineering

Writer John Seidle has written this much-needed advent to a special unconventional fuel source for college students and practising engineers in addition to a simple instruction manual in the event you are enthusiastic about coalbed methane each day and require undemanding, useful solutions within the fast paced company global

Best Practices for Dust Control in Coal Mining

Compiled by way of the U. S. Dept of overall healthiness and Human providers, CDC/NIOSH workplace of Mine safeguard and future health study, this 2010 guide used to be constructed to spot to be had engineering controls which can support the lessen employee publicity to respirable coal and silica airborne dirt and dust. The controls mentioned during this guide diversity from long-utilized controls that experience constructed into criteria to more moderen controls which are nonetheless being optimized.

Additional resources for Data mining and warehousing

Example text

048 Outlook attribute has the highest gain, therefore it is used as the decision attribute in the root node. Since Outlook has three possible values, the root node has three branches (sunny, overcast, rain). ” Since we have used Outlook at the root, we only decide on the remaining three attributes: Humidity, Temperature, or Wind. 970 Outlook Sunny Overcast Humidity High No Rain Yes Wind Strong Normal Yes Weak No Fig. 019 Humidity has the highest gain; therefore, it is used as the decision node.

In the decision tree at each node should be associated the non-goal attribute which is most informative among the attributes not yet considered in the path from the root. Entropy is used to measure how informative is a node. Which attribute is the best classifier? The estimation criterion in the decision tree algorithm is the selection of an attribute to test at each decision node in the tree. The goal is to select the attribute that is most useful for classifying examples. A good quantitative measure of the worth of an attribute is a statistical property called information gain that measures how well a given attribute separates the training examples according to their target classification.

The algorithm consists of five steps. 1. T ← the whole training set. Create a T node. 2. If all examples in T are positive, create a ‘P’ node with T as its parent and stop. 3. If all examples in T are negative, create a ‘N’ node with T as its parent and stop. Data Mining Techniques | 25 4. Select an attribute TN according their and X = vi as the 5. For each Ti do: T X with values v1, v2, …, vN and partition T into subsets T1, T2, …, values on X. , N) with T as their parent label of the branch from T to Ti.

Download PDF sample

Rated 4.14 of 5 – based on 12 votes