Sunday, 19 April 2009

DATA MINING

AIntroduction: very interesting topic was introduced as the presentation work by Mr. Remmy Kaaro together with Mr. Joshua Shendu. The topic was all about knowledge discovery. They explained a lot about how to extract hidden information from different resources. They actually gave a good presentation which was understandable to us.

By definition, data mining is the process of extracting and analyzing large amounts of information in order to find important data from different hidden sources as well as summarizing them into simple information that can be identified easily.

They explained several forms of data mining; Relation data mining, text mining, Audio mining, image data mining, web data mining and video data mining. All these were showing on how to extract the data from many different sources.

There are specifically three stages of data mining which are;Exploration:This stage usually starts with data preparation which involves cleaning data and data transformation.Model building and validation:This stage involves considering various models and choosing the best one based on their predictive performance.Deployment:This is the last stage which organizes and presents the knowledge gained in a way that the customer can use it, usually involves the use of the model selected as best in the previous stage and applying it to new data in order to generate predictions or estimates of the expected outcomes.

There are several advantages of this data mining; here are some of themIn banking: The bank analyzes the information and it can advertise certain products to one group while advertising different products to another group.Marking, researchers and reinforcement.

Regarding of the advantages of data mining still there are some disadvantages which is very bad incase the technology used wrongly. We have:Security issues; It needs the high quality security any leakage of information can cause problems.Misuse of information; if happens the information to leak, then it can be used sometimes in criminal matters and also use the information to harm other people because of their wealth records. Privacy issues; there are some issues which people do not want them to be known as public issues so the information should be kept hidden always.

Conclusion:Data mining is just a better solution and sometimes can act as a cure for certain types of problems in the daily life of human being.

No comments:

Post a Comment