{"id":1057,"date":"2024-01-26T14:56:30","date_gmt":"2024-01-26T14:56:30","guid":{"rendered":"https:\/\/learnlearn.uk\/ibcs\/?page_id=1057"},"modified":"2024-01-26T15:01:10","modified_gmt":"2024-01-26T15:01:10","slug":"data-integrity","status":"publish","type":"page","link":"https:\/\/learnlearn.uk\/ibcs\/data-integrity\/","title":{"rendered":"Data Integrity"},"content":{"rendered":"<div class=\"responsive-tabs\">\n<h2 class=\"tabtitle\">Data Mining<\/h2>\n<div class=\"tabcontent\">\n\n<h3>Data Mining<\/h3>\n<p><img decoding=\"async\" loading=\"lazy\" class=\"alignright size-medium wp-image-1058\" src=\"https:\/\/learnlearn.uk\/ibcs\/wp-content\/uploads\/sites\/25\/2024\/01\/data-mining-300x300.webp\" alt=\"\" width=\"300\" height=\"300\" srcset=\"https:\/\/learnlearn.uk\/ibcs\/wp-content\/uploads\/sites\/25\/2024\/01\/data-mining-300x300.webp 300w, https:\/\/learnlearn.uk\/ibcs\/wp-content\/uploads\/sites\/25\/2024\/01\/data-mining-150x150.webp 150w, https:\/\/learnlearn.uk\/ibcs\/wp-content\/uploads\/sites\/25\/2024\/01\/data-mining-768x768.webp 768w, https:\/\/learnlearn.uk\/ibcs\/wp-content\/uploads\/sites\/25\/2024\/01\/data-mining.webp 1024w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><\/p>\n<p>Data mining is the process of discovering patterns, correlations, and trends by sifting through large amounts of data stored in repositories, using various techniques from machine learning, statistics, and database systems.<\/p>\n<p>It involves the extraction of hidden predictive information from large databases and is a powerful tool that can help companies focus on the most important information in their data warehouses.<\/p>\n\n<\/div><h2 class=\"tabtitle\">Mining Steps<\/h2>\n<div class=\"tabcontent\">\n\n<h3>Data Mining Steps<\/h3>\n<p>1. Data Collection and Preparation<br \/>\nGathering relevant data from various sources and preparing it for analysis. This step includes data cleaning, integration, and transformation.<\/p>\n<p>2. Data Exploration and Understanding<br \/>\nUsing descriptive statistics and visualization techniques to better understand the nature of the data, its quality, and the underlying patterns.<\/p>\n<p>3 .Model Building and Validation<br \/>\nApplying appropriate algorithms to discover patterns and relationships within the data. T<\/p>\n<p>3. Deployment and Interpretation of Results<br \/>\nUsing the patterns and relationships found in the data to make decisions or predictions. The interpretation of these results should align with business objectives and needs.<\/p>\n<p>&nbsp;<\/p>\n\n<\/div><h2 class=\"tabtitle\">Cluster Analysis<\/h2>\n<div class=\"tabcontent\">\n\n<h3>Cluster Analysis<\/h3>\n<div id=\"attachment_1059\" style=\"width: 310px\" class=\"wp-caption alignright\"><a href=\"https:\/\/www.researchgate.net\/figure\/Biplot-table-of-the-cluster-analysis-for-n549-respondents-output-based-on-own-analysis_fig1_371226710\"><img aria-describedby=\"caption-attachment-1059\" decoding=\"async\" loading=\"lazy\" class=\"size-medium wp-image-1059\" src=\"https:\/\/learnlearn.uk\/ibcs\/wp-content\/uploads\/sites\/25\/2024\/01\/Cluster-Analysis-300x300.jpg\" alt=\"\" width=\"300\" height=\"300\" srcset=\"https:\/\/learnlearn.uk\/ibcs\/wp-content\/uploads\/sites\/25\/2024\/01\/Cluster-Analysis-300x300.jpg 300w, https:\/\/learnlearn.uk\/ibcs\/wp-content\/uploads\/sites\/25\/2024\/01\/Cluster-Analysis-150x150.jpg 150w, https:\/\/learnlearn.uk\/ibcs\/wp-content\/uploads\/sites\/25\/2024\/01\/Cluster-Analysis.jpg 320w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><\/a><p id=\"caption-attachment-1059\" class=\"wp-caption-text\">Image Source: Researchgate<\/p><\/div>\n<p>This is a technique used to group sets of objects in such a way that objects in the same group (or cluster) are more similar to each other than to those in other groups.<\/p>\n<p>It&#8217;s widely used in statistical data analysis for various applications, such as pattern recognition, image analysis, and bioinformatics.<\/p>\n<p>Clustering does not use pre-labeled classes; instead, it identifies similarities between data points and groups them accordingly.<\/p>\n<p>&nbsp;<\/p>\n\n<\/div><h2 class=\"tabtitle\">Classifications Analysis<\/h2>\n<div class=\"tabcontent\">\n\n<h3>Classifications Analysis<\/h3>\n<p><img decoding=\"async\" loading=\"lazy\" class=\"alignright size-medium wp-image-1060\" src=\"https:\/\/learnlearn.uk\/ibcs\/wp-content\/uploads\/sites\/25\/2024\/01\/spam-not-spam-300x300.webp\" alt=\"\" width=\"300\" height=\"300\" srcset=\"https:\/\/learnlearn.uk\/ibcs\/wp-content\/uploads\/sites\/25\/2024\/01\/spam-not-spam-300x300.webp 300w, https:\/\/learnlearn.uk\/ibcs\/wp-content\/uploads\/sites\/25\/2024\/01\/spam-not-spam-150x150.webp 150w, https:\/\/learnlearn.uk\/ibcs\/wp-content\/uploads\/sites\/25\/2024\/01\/spam-not-spam-768x768.webp 768w, https:\/\/learnlearn.uk\/ibcs\/wp-content\/uploads\/sites\/25\/2024\/01\/spam-not-spam.webp 1024w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><\/p>\n<p>This technique involves finding a model (or function) that describes and distinguishes data classes or concepts. The model is then used to predict the class of objects whose class label is unknown.<\/p>\n<p>It&#8217;s based on training data consisting of a set of training examples. Classification is common in applications where you need to categorize data into predefined labels, such spam\/not spam.<\/p>\n<p>&nbsp;<\/p>\n\n<\/div><h2 class=\"tabtitle\">Associations Analysis<\/h2>\n<div class=\"tabcontent\">\n\n<h3>Associations Analysis<\/h3>\n<p><img decoding=\"async\" loading=\"lazy\" class=\"alignright size-medium wp-image-1061\" src=\"https:\/\/learnlearn.uk\/ibcs\/wp-content\/uploads\/sites\/25\/2024\/01\/association-analysis-300x300.webp\" alt=\"\" width=\"300\" height=\"300\" srcset=\"https:\/\/learnlearn.uk\/ibcs\/wp-content\/uploads\/sites\/25\/2024\/01\/association-analysis-300x300.webp 300w, https:\/\/learnlearn.uk\/ibcs\/wp-content\/uploads\/sites\/25\/2024\/01\/association-analysis-150x150.webp 150w, https:\/\/learnlearn.uk\/ibcs\/wp-content\/uploads\/sites\/25\/2024\/01\/association-analysis-768x768.webp 768w, https:\/\/learnlearn.uk\/ibcs\/wp-content\/uploads\/sites\/25\/2024\/01\/association-analysis.webp 1024w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><\/p>\n<p>Association analysis is a rule-based method for discovering interesting relations between variables in large databases. It&#8217;s often used in market basket analysis to find relationships between items purchased together.<\/p>\n<p>The classic example is the &#8220;beer and diapers&#8221; scenario, where supermarkets discovered through association rule mining that these two products were often bought together.<\/p>\n\n<\/div><h2 class=\"tabtitle\">Sequential Analysis<\/h2>\n<div class=\"tabcontent\">\n\n<h3>Sequential Pattern Analysis<\/h3>\n<p>Sequential pattern mining is a topic in data mining concerned with finding statistically relevant patterns between data examples where the values are delivered in a sequence.<\/p>\n<p>It&#8217;s used in a variety of contexts, such as analyzing customer purchase behavior, web page visits, scientific experiments, and natural disasters.<\/p>\n\n<\/div><h2 class=\"tabtitle\">Forecasting<\/h2>\n<div class=\"tabcontent\">\n\n<h3>Forecasting<\/h3>\n<p>Forecasting involves using historical data as inputs to make informed estimates or predictions about future events. In the context of data mining, forecasting is often associated with time-series data analysis, used for predicting future trends based on past data.<\/p>\n<p>Common applications include stock market analysis, weather forecasting, and sales forecasting.<\/p>\n<\/div><\/div>\n","protected":false},"excerpt":{"rendered":"<p>Data Mining Data mining is the process of discovering patterns, correlations, and trends by sifting through large amounts of data stored in repositories, using various techniques from machine learning, statistics, and database systems. It involves the extraction of hidden predictive information from large databases and is a powerful tool that can help companies focus on&hellip;&nbsp;<a href=\"https:\/\/learnlearn.uk\/ibcs\/data-integrity\/\" class=\"\" rel=\"bookmark\">Read More &raquo;<span class=\"screen-reader-text\">Data Integrity<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"neve_meta_sidebar":"","neve_meta_container":"","neve_meta_enable_content_width":"off","neve_meta_content_width":100,"neve_meta_title_alignment":"","neve_meta_author_avatar":"","neve_post_elements_order":"","neve_meta_disable_header":"","neve_meta_disable_footer":"","neve_meta_disable_title":""},"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v20.6 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Data Integrity - IB Computer Science<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/learnlearn.uk\/ibcs\/data-integrity\/\" \/>\n<meta property=\"og:locale\" content=\"en_GB\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Data Integrity - IB Computer Science\" \/>\n<meta property=\"og:description\" content=\"Data Mining Data mining is the process of discovering patterns, correlations, and trends by sifting through large amounts of data stored in repositories, using various techniques from machine learning, statistics, and database systems. It involves the extraction of hidden predictive information from large databases and is a powerful tool that can help companies focus on&hellip;&nbsp;Read More &raquo;Data Integrity\" \/>\n<meta property=\"og:url\" content=\"https:\/\/learnlearn.uk\/ibcs\/data-integrity\/\" \/>\n<meta property=\"og:site_name\" content=\"IB Computer Science\" \/>\n<meta property=\"article:modified_time\" content=\"2024-01-26T15:01:10+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/learnlearn.uk\/ibcs\/wp-content\/uploads\/sites\/25\/2024\/01\/data-mining-300x300.webp\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Estimated reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"3 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/learnlearn.uk\/ibcs\/data-integrity\/\",\"url\":\"https:\/\/learnlearn.uk\/ibcs\/data-integrity\/\",\"name\":\"Data Integrity - IB Computer Science\",\"isPartOf\":{\"@id\":\"https:\/\/learnlearn.uk\/ibcs\/#website\"},\"datePublished\":\"2024-01-26T14:56:30+00:00\",\"dateModified\":\"2024-01-26T15:01:10+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/learnlearn.uk\/ibcs\/data-integrity\/#breadcrumb\"},\"inLanguage\":\"en-GB\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/learnlearn.uk\/ibcs\/data-integrity\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/learnlearn.uk\/ibcs\/data-integrity\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"IB Computer Science\",\"item\":\"https:\/\/learnlearn.uk\/ibcs\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Data Integrity\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/learnlearn.uk\/ibcs\/#website\",\"url\":\"https:\/\/learnlearn.uk\/ibcs\/\",\"name\":\"IB Computer Science\",\"description\":\"- learnlearn..uk\",\"publisher\":{\"@id\":\"https:\/\/learnlearn.uk\/ibcs\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/learnlearn.uk\/ibcs\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-GB\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/learnlearn.uk\/ibcs\/#organization\",\"name\":\"IB Computer Science\",\"url\":\"https:\/\/learnlearn.uk\/ibcs\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-GB\",\"@id\":\"https:\/\/learnlearn.uk\/ibcs\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/learnlearn.uk\/ibcs\/wp-content\/uploads\/sites\/25\/2022\/09\/LearnLearnLogowhite-300x41.png\",\"contentUrl\":\"https:\/\/learnlearn.uk\/ibcs\/wp-content\/uploads\/sites\/25\/2022\/09\/LearnLearnLogowhite-300x41.png\",\"width\":300,\"height\":41,\"caption\":\"IB Computer Science\"},\"image\":{\"@id\":\"https:\/\/learnlearn.uk\/ibcs\/#\/schema\/logo\/image\/\"}}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Data Integrity - IB Computer Science","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/learnlearn.uk\/ibcs\/data-integrity\/","og_locale":"en_GB","og_type":"article","og_title":"Data Integrity - IB Computer Science","og_description":"Data Mining Data mining is the process of discovering patterns, correlations, and trends by sifting through large amounts of data stored in repositories, using various techniques from machine learning, statistics, and database systems. It involves the extraction of hidden predictive information from large databases and is a powerful tool that can help companies focus on&hellip;&nbsp;Read More &raquo;Data Integrity","og_url":"https:\/\/learnlearn.uk\/ibcs\/data-integrity\/","og_site_name":"IB Computer Science","article_modified_time":"2024-01-26T15:01:10+00:00","og_image":[{"url":"https:\/\/learnlearn.uk\/ibcs\/wp-content\/uploads\/sites\/25\/2024\/01\/data-mining-300x300.webp"}],"twitter_card":"summary_large_image","twitter_misc":{"Estimated reading time":"3 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/learnlearn.uk\/ibcs\/data-integrity\/","url":"https:\/\/learnlearn.uk\/ibcs\/data-integrity\/","name":"Data Integrity - IB Computer Science","isPartOf":{"@id":"https:\/\/learnlearn.uk\/ibcs\/#website"},"datePublished":"2024-01-26T14:56:30+00:00","dateModified":"2024-01-26T15:01:10+00:00","breadcrumb":{"@id":"https:\/\/learnlearn.uk\/ibcs\/data-integrity\/#breadcrumb"},"inLanguage":"en-GB","potentialAction":[{"@type":"ReadAction","target":["https:\/\/learnlearn.uk\/ibcs\/data-integrity\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/learnlearn.uk\/ibcs\/data-integrity\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"IB Computer Science","item":"https:\/\/learnlearn.uk\/ibcs\/"},{"@type":"ListItem","position":2,"name":"Data Integrity"}]},{"@type":"WebSite","@id":"https:\/\/learnlearn.uk\/ibcs\/#website","url":"https:\/\/learnlearn.uk\/ibcs\/","name":"IB Computer Science","description":"- learnlearn..uk","publisher":{"@id":"https:\/\/learnlearn.uk\/ibcs\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/learnlearn.uk\/ibcs\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-GB"},{"@type":"Organization","@id":"https:\/\/learnlearn.uk\/ibcs\/#organization","name":"IB Computer Science","url":"https:\/\/learnlearn.uk\/ibcs\/","logo":{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/learnlearn.uk\/ibcs\/#\/schema\/logo\/image\/","url":"https:\/\/learnlearn.uk\/ibcs\/wp-content\/uploads\/sites\/25\/2022\/09\/LearnLearnLogowhite-300x41.png","contentUrl":"https:\/\/learnlearn.uk\/ibcs\/wp-content\/uploads\/sites\/25\/2022\/09\/LearnLearnLogowhite-300x41.png","width":300,"height":41,"caption":"IB Computer Science"},"image":{"@id":"https:\/\/learnlearn.uk\/ibcs\/#\/schema\/logo\/image\/"}}]}},"rttpg_featured_image_url":null,"rttpg_author":{"display_name":"learnlearnadmin","author_link":"https:\/\/learnlearn.uk\/ibcs\/author\/learnlearnadmin\/"},"rttpg_comment":0,"rttpg_category":null,"rttpg_excerpt":"Data Mining Data mining is the process of discovering patterns, correlations, and trends by sifting through large amounts of data stored in repositories, using various techniques from machine learning, statistics, and database systems. It involves the extraction of hidden predictive information from large databases and is a powerful tool that can help companies focus on&hellip;&nbsp;Read&hellip;","_links":{"self":[{"href":"https:\/\/learnlearn.uk\/ibcs\/wp-json\/wp\/v2\/pages\/1057"}],"collection":[{"href":"https:\/\/learnlearn.uk\/ibcs\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/learnlearn.uk\/ibcs\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/learnlearn.uk\/ibcs\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/learnlearn.uk\/ibcs\/wp-json\/wp\/v2\/comments?post=1057"}],"version-history":[{"count":3,"href":"https:\/\/learnlearn.uk\/ibcs\/wp-json\/wp\/v2\/pages\/1057\/revisions"}],"predecessor-version":[{"id":1065,"href":"https:\/\/learnlearn.uk\/ibcs\/wp-json\/wp\/v2\/pages\/1057\/revisions\/1065"}],"wp:attachment":[{"href":"https:\/\/learnlearn.uk\/ibcs\/wp-json\/wp\/v2\/media?parent=1057"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}