Premium Essay

Data Mining Group Project

In: Other Topics

Submitted By LuckyPBF
Words 400
Pages 2
As a group, we chose Data Mining for our project and in specific, an organization very popular and well used, Facebook. Facebook was founded on February 4, 2004 and its mission or purpose was to provide a social media platform. There is not a subscription cost and the platform is freely available to anyone with an internet connection and email address. However, there are advertising tools which one can pay for to ensure your various posts reach more of an audience. For example, you can set up a page specifically to market T-shirts to people and if you want to ensure that many people see it, you can pay Facebook to “push” your post to the maximum amount of people who are likely to be interested in such a product.

The services offered are:

1.Communication with friends and family
2.Advertising
3.Marketing
4.Sharing information

We chose this organization because it is a very common and popular organization that most of us use and have interest in. Additionally, data mining has become something the general public is aware of because of people like Edward Snowden, formerly of the NSA, and Julian Assange of WikiLeaks.

Data mining is the collection of numbers or just information in general. This information is collected for statistical purposes. The data that is collected is processed through mega computers. These computers sift through information and arrange it in a much easier to understand format. Data mining software is one of a number of analytical tools for analyzing data. Technically, data mining is the process of finding correlations or patterns among dozens of fields in large relational databases. It allows users to categorize information from many different angles to summarize the relationship it identifies.

To this end, Facebook captures user data using various algorithms to optimize browsing experience based on location,…...

Similar Documents

Premium Essay

Data Mining

...Data Mining Jenna Walker Dr. Emmanuel Nyeanchi Information Systems Decision Making May 30, 2012 Abstract Businesses are utilizing techniques such as data mining to create a competitive advantage customer loyalty. Data mining allows business to analyze customer information, such as demographics and purchase history for a better understanding of what the customers need and what they will respond to. Data mining currently takes place in several industries, and will only become even more widespread as the benefits are endless. The purpose of this paper is to gain research and examine data mining, its benefits to businesses, and issues or concerns it will need to overcome. Real world case studies of how data mining is used will also be presented for a deeper understanding. This study will show that despite its disadvantages, data mining is an important step for a business to better understand its customers, and is the future of business marking and operational planning. Tools and Benefits of data mining Before examining the benefits of data mining, it is important to understand what data mining is exactly. Data mining is defined as “a process that uses statistical, mathematical, artificial intelligence, and machine-learning techniques to extract and identify useful information and subsequent knowledge from large databases, including data warehouses” (Turban & Volonino, 2011). The information identified using data mining includes patterns indicating......

Words: 1900 - Pages: 8

Premium Essay

Data Mining

...Data Mining Prepared by: Kirsten Sullivan Strayer University CIS 500 Dr. Baab September 9, 2012 Data mining is a concept that companies use to gain new customers or clients in an effort to make their business and profits grow. The ability to use data mining can result in the accrual of new customers by taking the new information and advertising to customers who are either not currently utilizing the business's product or also in winning additional customers that may be purchasing from the competitor. Generally, data are any “facts, numbers, or text that can be processed by a computer.”1 Today, organizations are accumulating vast and growing amounts of data in different formats and different databases. This includes operational or transactional data such as, sales, cost, inventory, payroll, and accounting. Data mining also known as “knowledge discovery”, is the process of analyzing data from different perspectives and summarizing it into useful information- information that can then be used to increase revenue, cuts costs, and continue the goals outlined for the company. Data mining consists of five major elements: “Extract, transform, and load transaction data onto the data warehouse system, store and manage the data in a multidimensional database system, provide data access to business analysts and information technology professionals, analyze the data by application software, present the data in a useful format, such as a graph or......

Words: 1778 - Pages: 8

Premium Essay

Data Mining

...Data Mining 0. Abstract With the development of different fields, artificial intelligence, machine learning, statistic, database, pattern recognition and neurocomputing they merge to a newly technology, the data mining. The ultimate goal of data mining is to obtain knowledge from the large database. It helps to discover previously unknown patterns, most of the time it is followed by deeper manual evaluation to explain and correlate the results to establish a new knowledge. It is often practically used by government, bank, insurance company and medical researcher. A general basic idea of data mining would be introduced. In this article, they are divided into four types, predictive modeling, database segmentation, link analysis and deviation detection. A brief introduction will explain the variation among them. For the next part, current privacy, ethical as well as technical issue regarding data mining will be discussed. Besides, the future development trends, especially concept of the developing sport data mining is written. Last but not the least different views on data mining including the good side, the drawback and our views are integrated into the paragraph. 1. Introduction This century, is the age of digital world. We are no longer able to live without the computing technology. Due to information explosion, we are having difficulty to obtain knowledge from large amount of unorganized data. One of the solutions, Knowledge Discovery in Database (KDD) is......

Words: 1700 - Pages: 7

Premium Essay

Data Mining

...A data warehouse is a subject-oriented, integrated, time-variant, nonupdateable collection of data used to support management decision-making and business intelligence (Hoffer, 2011). Business Intelligence (BI) is a term that describes a comprehensive, cohesive, and integrated set of tools and processes used to capture, collect, integrate, store and analyze data with the purpose of generating and presenting information to support decision making (Coronel, Morris, & Rob, 2013). Data Warehouse A data warehouse enables an organization to obtain the information about future trends and track customer demands. The key terms that define data warehouse are subject-oriented, integrated, time-variant, and nonupdateable. Each one has its meaning and importance in data warehousing. Subject-oriented – A data warehouse is organized around the key subjects that may include but not limited to customers, patients, students, products, and time. Integrated – The data in the data warehouse are defined using consistent naming conventions, formats, structures, and related characteristics. This means data warehouse holds one version of “the truth”. Time-variant – Data in the data warehouse contain a time dimension so they could be used to study trends and changes. Nonupdateable – Once the data gets loaded into the data warehouse, it could not be updated by the end users. Data warehousing is a process where organizations create and maintain data warehouses and extract......

Words: 1390 - Pages: 6

Premium Essay

Data Mining

...Data mining is an iterative process of selecting, exploring and modeling large amounts of data to identify meaningful, logical patterns and relationships among key variables.  Data mining is used to uncover trends, predict future events and assess the merits of various courses of action.             When employing, predictive analytics and data mining can make marketing more efficient. There are many techniques and methods, including business intelligence data collection. Predictive analytics is using business intelligence data for forecasting and modeling. It is a way to use predictive analysis data to predict future patterns. It is used widely in the insurance, medical and credit industries. Assessment of credit, and assignment of a credit score is probably the most widely known use of predictive analytics. Using events of the past, managers are able to estimate the likelihood of future events. Data mining aids predictive analysis by providing a record of the past that can be analyzed and used to predict which customers are most likely to renew, purchase, or purchase related products and services. Business intelligence data mining is important to your marketing campaigns. Proper data mining algorithms and predictive modeling can narrow your target audience and allow you to tailor your ads to each online customer as he or she navigates your site. Your marketing team will have the opportunity to develop multiple advertisements based on the past clicks of your visitors.......

Words: 1136 - Pages: 5

Premium Essay

Data Mining

...Data Mining Objectives: Highlight the characteristics of Data mining Operations, Techniques and Tools. A Brief Overview Online Analytical Processing (OLAP): OLAP is the dynamic synthesis, analysis, and consolidation of large volumns of multi-dimensional data. Multi-dimensional OLAP support common analyst operations, such as: ▪ Considation – aggregate of data, e.g. roll-ups from branches to regions. ▪ Drill-down – showing details, just the reverse of considation. ▪ Slicing and dicing – pivoting. Looking at the data from different viewpoints. E.g. X, Y, Z axis as salesman, Nth quarter and products, or region, Nth quarter and products. A Brief Overview Data Mining: Construct an advanced architecture for storing information in a multi-dimension data warehouse is just the first step to evolve from traditional DBMS. To realize the value of a data warehouse, it is necessary to extract the knowledge hidden within the warehouse. Unlike OLAP, which reveal patterns that are known in advance, Data Mining uses the machine learning techniques to find hidden relationships within data. So Data Mining is to ▪ Analyse data, ▪ Use software techniques ▪ Finding hidden and unexpected patterns and relationships in sets of data. Examples of Data Mining Applications: ▪ Identifying potential credit card customer groups ▪ Identifying buying patterns of customers. ▪ Predicting trends of......

Words: 1258 - Pages: 6

Premium Essay

Data Warehousing and Data Mining

...Introduction 2 Assumptions 3 Data Availability 3 Overnight processing window 3 Business sponsor 4 Source system knowledge 4 Significance 5 Data warehouse 6 ETL: (Extract, Transform, Load) 6 Data Mining 6 Data Mining Techniques 7 Data Warehousing 8 Data Mining 8 Technology in Health Care 9 Diseases Analysis 9 Treatment strategies 9 Healthcare Resource Management 10 Customer Relationship Management 10 Recommended Solution 11 Corporate Solution 11 Technological Solution 11 Justification and Conclusion 12 References 14 Health Authority Data (Appendix A) 16 Data Warehousing Implementation (Appendix B) 19 Data Mining Implementation (Appendix B) 22 Technological Scenarios in Health Authorities (Appendix C) 26 Technology Tools 27 Data Management Technology Introduction The amount of information offered to us is literally astonishing, and the worthiness of data as an organizational asset is widely acknowledged. Nonetheless the failure to manage this enormous amount of data, and to swiftly acquire the information that is relevant to any particular question, as the volume of information rises, demonstrates to be a distraction and a liability, rather than an asset. This paradox energies the need for increasingly powerful and flexible data management systems. To achieve efficiency and a great level of productivity out of large and complex datasets, operators need have tools that streamline the tasks of managing the data and......

Words: 8284 - Pages: 34

Free Essay

Data Mining

...A Statistical Perspective on Data Mining Ranjan Maitra∗ Abstract Technological advances have led to new and automated data collection methods. Datasets once at a premium are often plentiful nowadays and sometimes indeed massive. A new breed of challenges are thus presented – primary among them is the need for methodology to analyze such masses of data with a view to understanding complex phenomena and relationships. Such capability is provided by data mining which combines core statistical techniques with those from machine intelligence. This article reviews the current state of the discipline from a statistician’s perspective, illustrates issues with real-life examples, discusses the connections with statistics, the differences, the failings and the challenges ahead. 1 Introduction The information age has been matched by an explosion of data. This surfeit has been a result of modern, improved and, in many cases, automated methods for both data collection and storage. For instance, many stores tag their items with a product-specific bar code, which is scanned in when the corresponding item is bought. This automatically creates a gigantic repository of information on products and product combinations sold. Similar databases are also created by automated book-keeping, digital communication tools or by remote sensing satellites, and aided by the availability of affordable and effective storage mechanisms – magnetic tapes, data warehouses and so on. This has created a......

Words: 22784 - Pages: 92

Premium Essay

Data Mining and Data Warehousing

...Data mining and warehousing and its importance in the organization * Data Mining Data mining is the process of analyzing data from different perspectives and summarizing it into useful information - information that can be used to increase revenue, cuts costs, or both. Data mining software is one of a number of analytical tools for analyzing data. It allows users to analyze data from many different dimensions or angles, categorize it, and summarize the relationships identified. Technically, data mining is the process of finding correlations or patterns among dozens of fields in large relational databases. Data mining is primarily used today by companies with a strong consumer focus - retail, financial, communication, and marketing organizations. It enables these companies to determine relationships among internal factors such as price, product positioning, or staff skills, and external factors such as economic indicators, competition, and customer demographics. And, it enables them to determine the impact on sales, customer satisfaction, and corporate profits. Finally, it enables them to drill down into summary information to view detail transactional data. For example, “Entertainers Incorporated” is an organization which deals with entertainers for events. So the need to attract customers and communicating with them is essential. Customer satisfaction in their service is much needed for them, for the customers to approach them for the next event too. So considering......

Words: 1344 - Pages: 6

Premium Essay

Data Mining

...Assignment 4: Data Mining CIS 500 Professor: Dr. Edwin Otto Strayer University August 30, 2013 “Data mining is a process that uses statistical, mathematical, artificial intelligence, and machine learning techniques to extract and identify useful information and subsequent knowledge from large databases, including data warehouses” (Turban, 2011). Predictive analytics serves as a benefit of data mining because it’s a process that uses machine learning to analyze data and make predictions. This can be beneficial to a business because it can be helpful in understanding the behavior of customers. A good example of this would be a business using predictive analytics to decide what level of pricing should be used in correlation with sales information. A business could look at historical data for products, sales, and customers to determine the price for a given product and customer at the right time. Amazon is a heavy user of predictive pricing (Mehra, 2013). This technique is also used in Supply Chain Management because it helps you to understand consumer demand to manage the overall process. This includes delivery, returns, forecasting, sourcing, planning, and order fulfillment. The advantage is if a retailer can predict revenue from a specific product in a reasonable amount of time it will result in better inventory management, use of space, cash flow, and the elimination of out of stock items. Association discovery in products sold to customers is used......

Words: 1499 - Pages: 6

Premium Essay

Data Mining

...Data Mining By: Holly Gildea CIS 500 Dr. Janet Durgin June 09, 2013 Data Mining We learn that data mining is a method of evaluating data from different viewpoints and summarizing it into useful information. Such information can be beneficial and used to increase things like revenue, and cutting costs, and so on. There are four categories that we will look at and determine the benefits for in regards to data mining: predictive analytics to understand the behavior of customers, associations discovery in products sold to customers, web mining to discover business intelligence from web customers, and clustering to find related customer information. To understand the behavior of customers by the use predictive analytics we must first understand what predictive analytics is. “Predictive analytics is the process of dealing with a variety of data and applying various mathematical formulas to discover the best decision for a given situation” (ArticleSnatch, 2011). This gives any business a competitive edge and helps to remove the guess work out of the decision making process therefore helping to find the right solution in a shorter amount of time. In order to find the solution faster there are a seven simple steps that must be worked thru first: what is the problem for the company, searching for multiple data resources, take the patterns that are observed from that data, creating a model that contains the problem and the data, categorize the data and find......

Words: 1843 - Pages: 8

Premium Essay

Data Mining

...ITT TECHNICAL INSTITUTE DATA MINING REPORT ONE AND TWO Shirly Mu, Todd Hughson, James Wall PROBLEM SOLVING THEORY ONSITE COURSE GS1140 INSTUCTOR LJILJANA MORRIS Today we are doing a project report on Costco. For Sol and Robert Price in 1976 they asked friends and family to help out with an opening price of two point five millon to open Price Club on July twelfth, they open their shop in an air hanger on Boulevard in San Diego, California. They were originally going to serve only small business. Mr. Price found out that it will be more beneficial to serve select customers. Costco was founded by James Sinegal and Jeffery H. Brotman. Costco opened its doors in 1983 in Seattle, Washington. Price Club and Costco later merged and renamed the business PriceCostco. And in 1997 due to its success the name was changed again to Costco Wholesale. (About History, 2014) Costco Wholesale stores headquarters are located in Issaquah, WA. The mission statement for Costco is to continually provide our members with quality goods and services at the lowest possible prices. (Farfan, 2014) There are three types of membership cards at Costco; Executive membership, Business membership and Gold membership. The one I have is executive membership this cost about one hundred and twenty dollars sounds like a lot but I’ll be able to get two percent back in my shopping. If I do not get back more than fifty five dollars back for the entire year they will give me that amount or the two percent......

Words: 1306 - Pages: 6

Premium Essay

Data Mining

...1. Define data mining. Why are there many different names and definitions for data mining? Data mining is the process through which previously unknown patterns in data were discovered. Another definition would be “a process that uses statistical, mathematical, artificial intelligence, and machine learning techniques to extract and identify useful information and subsequent knowledge from large databases.” This includes most types of automated data analysis. A third definition: Data mining is the process of finding mathematical patterns from (usually) large sets of data; these can be rules, affinities, correlations, trends, or prediction models. Data mining has many definitions because it’s been stretched beyond those limits by some software vendors to include most forms of data analysis in order to increase sales using the popularity of data mining. What recent factors have increased the popularity of data mining? Following are some of most pronounced reasons: * More intense competition at the global scale driven by customers’ ever-changing needs and wants in an increasingly saturated marketplace. * General recognition of the untapped value hidden in large data sources. * Consolidation and integration of database records, which enables a single view of customers, vendors, transactions, etc. * Consolidation of databases and other data repositories into a single location in the form of a data warehouse. * The exponential......

Words: 4581 - Pages: 19

Premium Essay

Data Mining

...Lawrence Bandy Leslin Cruz Wen. P.M. Feb. 15 2016 General Data Mining (Part1) * What is data mining and how can it benefit/ not benefit society? Data mining is a technique that is used to analyze and collect data from different area of everyone life. Also Data mining gathers mathematics, genetics and marketing to analyze data from different dimensions or angles to put in an organize graph or data sheet for research proposes. It can benefit society by organize a data sheet for mangers or bosses of a company that needs to purchase products to see what is the most selling item that needs stocked. It also could not benefit society if the personal in charge will not take the time and effort to put the right information into the data base. * Will it ultimately lead to behavior control? Why and How? The behavior control will not ultimately effect the individual. The reason is that the person is entitled to their own decisions on purchasing what they want. How, no one can decide for you. You control your own money and spend it the way you want to. * What is the Clustering Analysis? Is a way of grouping a set of objects in a way that objects in the same group are very similar to each other than those in other groups. * What is Anomaly or Outlier Detection? Is the identification of items, events or observations which do not conform to an expected pattern or other items in a dataset. * What is the Association Rule? It is a method to intended to......

Words: 711 - Pages: 3

Premium Essay

Data Mining

...The increasing use of data mining by corporations and advertisers obviously creates apprehension from the perspective of the individual consumer due to privacy, security and the potential use of inaccurate information. The idea that there are data warehouses that contain customers’ personal information can be rather frightening. However, the use of data mining by these organizations can also lead to numerous benefits for consumers they otherwise would not have realized. Besides the obvious benefit of guiding consumers to products or services they’d be more interested in purchasing, the use of data mining by companies has also benefitted individuals’ health and financial safety. Not long after the use of data mining came into prominence the use of data mining consumer information vs. consumer privacy became a major issue in early 1998 after CVS and Giant entered into an agreement with Elensys, a Massachusetts direct marketing company, to send reminders to customers who had not renewed their prescriptions. However, neither CVS nor Giant explained how the program would work or received their customers' permission to transfer their prescription records to a third party (Pyatt Jr.). Giant and CVS’s rationale for entering into this agreement was “to develop a positive program to benefit consumers, many of whom don't take their medication properly,” (Pyatt Jr.). Even though their primary intention was good, Giant and CVS were not transparent about their agreement with......

Words: 949 - Pages: 4