{"appState":{"pageLoadApiCallsStatus":true},"categoryState":{"relatedCategories":{"headers":{"timestamp":"2025-04-17T16:01:06+00:00"},"categoryId":33578,"data":{"title":"Big Data","slug":"big-data","image":{"src":null,"width":0,"height":0},"breadcrumbs":[{"name":"Technology","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33512"},"slug":"technology","categoryId":33512},{"name":"Information Technology","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33572"},"slug":"information-technology","categoryId":33572},{"name":"Data Science","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33577"},"slug":"data-science","categoryId":33577},{"name":"Big Data","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33578"},"slug":"big-data","categoryId":33578}],"parentCategory":{"categoryId":33577,"title":"Data Science","slug":"data-science","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33577"}},"childCategories":[],"description":"What's the biggest dataset you can imagine? Well, multiply that by a thousand and you're probably still not close to the mammoth piles of info that big data pros process. Learn all about it here.","relatedArticles":{"self":"https://dummies-api.dummies.com/v2/articles?category=33578&offset=0&size=5"},"hasArticle":true,"hasBook":true,"articleCount":174,"bookCount":3},"_links":{"self":"https://dummies-api.dummies.com/v2/categories/33578"}},"relatedCategoriesLoadedStatus":"success"},"listState":{"list":{"count":10,"total":174,"items":[{"headers":{"creationTime":"2023-12-01T15:39:09+00:00","modifiedTime":"2023-12-01T15:39:09+00:00","timestamp":"2023-12-01T18:01:09+00:00"},"data":{"breadcrumbs":[{"name":"Technology","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33512"},"slug":"technology","categoryId":33512},{"name":"Information Technology","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33572"},"slug":"information-technology","categoryId":33572},{"name":"Data Science","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33577"},"slug":"data-science","categoryId":33577},{"name":"Big Data","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33578"},"slug":"big-data","categoryId":33578}],"title":"Beyond Boundaries: Unstructured Data Orchestration","strippedTitle":"beyond boundaries: unstructured data orchestration","slug":"beyond-boundaries-unstructured-data-orchestration","canonicalUrl":"","seo":{"metaDescription":"Find out what data orchestration is and how an unstructured data architecture can work for you.","noIndex":0,"noFollow":0},"content":"Getting the most out of your unstructured data is an essential task for any organization these days, especially when considering the disparate storage systems, applications, and user locations. So, it’s not an accident that data orchestration is the term that brings everything together.\r\n\r\nBringing all your data together shares similarities with conducting an orchestra. Instead of combining the violin, oboe, and cello, this brand of orchestration combines distributed data types from different places, platforms, and locations working as a cohesive entity presented to applications or users anywhere. That’s because historically, accessing high-performance data outside of your computer network was inefficient. Because the storage infrastructure existed in a silo, systems like HPC Parallel (which lets users store and access shared data across multiple networked storage nodes), Enterprise NAS (which allows large-scale storage and access to other networks), and Global Namespace (virtually simplifies network file systems) were limited when it came to sharing. Because each operated independently, the data within each system was siloed making it a problem collaborating with data sets over multiple locations.\r\n\r\nCollaboration was possible, but too often you lost the ability to have high performance. This Boolean logic decreased potential because having an IT architecture that supported both high performance and collaboration with data sets from different storage silos typically became an either/or decision: You were forced to choose one but never both.\r\n<h2 id=\"tab1\" >What is data orchestration?</h2>\r\nData orchestration is the automated process of taking siloed data from multiple data storage systems and locations, combining and organizing it into a single namespace. Then a high-performance file system can place data in the edge service, data center, or cloud service most optimal for the workload.\r\n\r\nThe recent rise of data analytic applications and artificial intelligence (AI) capabilities has accelerated the use of data across different locations and even different organizations. In the next data cycle, organizations will need both high-performance and agility with their data to compete and thrive in a competitive environment.\r\n\r\nThat means data no longer has a 1:1 relationship with the applications and compute environment that generated it. It needs to be used, analyzed, and repurposed with different AI models and alternate workloads, and across a remote, collaborative environment.\r\n<p class=\"remember\">Hammerspace’s technology makes data available to different foundational models, remote applications, decentralized compute clusters, and remote workers to automate and streamline data-driven development programs, data insights, and business decision making. This capability enables a unified, fast, and efficient global data environment for the entire workflow — from data creation to processing, collaboration, and archiving across edge devices, data centers, and public and private clouds.</p>\r\n\r\nControl of enterprise data services for governance, security, data protection, and compliance can now be implemented globally at a file-granular level across all storage types and locations. Applications and AI models can access data stored in remote locations while using automated orchestration tools to provide high-performance local access when needed for processing. Organizations can grow their talent pools with access to team members no matter where they reside.\r\n<h2 id=\"tab2\" >Decentralizing the data center</h2>\r\nData collection has become more prominent, and the traditional system of centralized data management has limitations. Issues of centralized data storage can limit the amount of data available to applications. Then, there are the high infrastructure costs when multiple applications are needed to manage and move data, multiple copies of data are retained in different storage systems, and more headcount is needed to manage the complex, disconnected infrastructure environment. Such setbacks suggest that the data center is no longer the center of data and storage system constraints should no longer define data architectures.\r\n\r\nHammerspace specializes in decentralized environments, where data may need to span two or more sites and possibly one or more cloud providers and regions, and/or where a remote workforce needs to collaborate in real time. It enables a global data environment by providing a unified, parallel global file system.\r\n<h2 id=\"tab3\" >Enabling a global data environment</h2>\r\nHammerspace completely revolutionizes previously held notions of how unstructured data architectures should be designed, delivering the performance needed across distributed environments to\r\n<ul>\r\n \t<li>Free workloads from data silos.</li>\r\n \t<li>Eliminate copy proliferation.</li>\r\n \t<li>Provide direct data access through local metadata to applications and users, no matter where the data is stored.</li>\r\n</ul>\r\nThis technology allows organizations to take full advantage of the performance capabilities of any server, storage system, and network anywhere in the world. This capability enables a unified, fast, and efficient global data environment for the entire workflow, from data creation to processing, collaboration, and archiving across edge devices, data centers, and public and private clouds.\r\n\r\nThe days of enterprises struggling with a siloed, distributed, and inefficient data environment are over. It’s time to start expecting more from data architectures with automated data orchestration. Find out how by downloading <em>Unstructured Data Orchestration For Dummies,</em> Hammerspace Special Edition, <a class=\"bookSponsor-btn\" href=\"https://hammerspace.com/for-dummies/\" target=\"_blank\" rel=\"noopener\" data-testid=\"bookSponsorDownloadButton\">here</a>.","description":"Getting the most out of your unstructured data is an essential task for any organization these days, especially when considering the disparate storage systems, applications, and user locations. So, it’s not an accident that data orchestration is the term that brings everything together.\r\n\r\nBringing all your data together shares similarities with conducting an orchestra. Instead of combining the violin, oboe, and cello, this brand of orchestration combines distributed data types from different places, platforms, and locations working as a cohesive entity presented to applications or users anywhere. That’s because historically, accessing high-performance data outside of your computer network was inefficient. Because the storage infrastructure existed in a silo, systems like HPC Parallel (which lets users store and access shared data across multiple networked storage nodes), Enterprise NAS (which allows large-scale storage and access to other networks), and Global Namespace (virtually simplifies network file systems) were limited when it came to sharing. Because each operated independently, the data within each system was siloed making it a problem collaborating with data sets over multiple locations.\r\n\r\nCollaboration was possible, but too often you lost the ability to have high performance. This Boolean logic decreased potential because having an IT architecture that supported both high performance and collaboration with data sets from different storage silos typically became an either/or decision: You were forced to choose one but never both.\r\n<h2 id=\"tab1\" >What is data orchestration?</h2>\r\nData orchestration is the automated process of taking siloed data from multiple data storage systems and locations, combining and organizing it into a single namespace. Then a high-performance file system can place data in the edge service, data center, or cloud service most optimal for the workload.\r\n\r\nThe recent rise of data analytic applications and artificial intelligence (AI) capabilities has accelerated the use of data across different locations and even different organizations. In the next data cycle, organizations will need both high-performance and agility with their data to compete and thrive in a competitive environment.\r\n\r\nThat means data no longer has a 1:1 relationship with the applications and compute environment that generated it. It needs to be used, analyzed, and repurposed with different AI models and alternate workloads, and across a remote, collaborative environment.\r\n<p class=\"remember\">Hammerspace’s technology makes data available to different foundational models, remote applications, decentralized compute clusters, and remote workers to automate and streamline data-driven development programs, data insights, and business decision making. This capability enables a unified, fast, and efficient global data environment for the entire workflow — from data creation to processing, collaboration, and archiving across edge devices, data centers, and public and private clouds.</p>\r\n\r\nControl of enterprise data services for governance, security, data protection, and compliance can now be implemented globally at a file-granular level across all storage types and locations. Applications and AI models can access data stored in remote locations while using automated orchestration tools to provide high-performance local access when needed for processing. Organizations can grow their talent pools with access to team members no matter where they reside.\r\n<h2 id=\"tab2\" >Decentralizing the data center</h2>\r\nData collection has become more prominent, and the traditional system of centralized data management has limitations. Issues of centralized data storage can limit the amount of data available to applications. Then, there are the high infrastructure costs when multiple applications are needed to manage and move data, multiple copies of data are retained in different storage systems, and more headcount is needed to manage the complex, disconnected infrastructure environment. Such setbacks suggest that the data center is no longer the center of data and storage system constraints should no longer define data architectures.\r\n\r\nHammerspace specializes in decentralized environments, where data may need to span two or more sites and possibly one or more cloud providers and regions, and/or where a remote workforce needs to collaborate in real time. It enables a global data environment by providing a unified, parallel global file system.\r\n<h2 id=\"tab3\" >Enabling a global data environment</h2>\r\nHammerspace completely revolutionizes previously held notions of how unstructured data architectures should be designed, delivering the performance needed across distributed environments to\r\n<ul>\r\n \t<li>Free workloads from data silos.</li>\r\n \t<li>Eliminate copy proliferation.</li>\r\n \t<li>Provide direct data access through local metadata to applications and users, no matter where the data is stored.</li>\r\n</ul>\r\nThis technology allows organizations to take full advantage of the performance capabilities of any server, storage system, and network anywhere in the world. This capability enables a unified, fast, and efficient global data environment for the entire workflow, from data creation to processing, collaboration, and archiving across edge devices, data centers, and public and private clouds.\r\n\r\nThe days of enterprises struggling with a siloed, distributed, and inefficient data environment are over. It’s time to start expecting more from data architectures with automated data orchestration. Find out how by downloading <em>Unstructured Data Orchestration For Dummies,</em> Hammerspace Special Edition, <a class=\"bookSponsor-btn\" href=\"https://hammerspace.com/for-dummies/\" target=\"_blank\" rel=\"noopener\" data-testid=\"bookSponsorDownloadButton\">here</a>.","blurb":"","authors":[{"authorId":9204,"name":"John Carucci","slug":"john-carucci","description":" <p><b>John Carucci </b>is not a celebrity, though he certainly brushes up against the stars of stage and screen on a regular basis in his role as an Entertainment TV Producer with the Associated Press. Along with hobnobbing with actors and musicians, John is also author of <i>Digital SLR Video & Filmmaking For Dummies</i> and two editions of <i>GoPro Cameras For Dummies</i>.</p> ","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9204"}}],"primaryCategoryTaxonomy":{"categoryId":33578,"title":"Big Data","slug":"big-data","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33578"}},"secondaryCategoryTaxonomy":{"categoryId":0,"title":null,"slug":null,"_links":null},"tertiaryCategoryTaxonomy":{"categoryId":0,"title":null,"slug":null,"_links":null},"trendingArticles":[{"articleId":192609,"title":"How to Pray the Rosary: A Comprehensive Guide","slug":"how-to-pray-the-rosary","categoryList":["body-mind-spirit","religion-spirituality","christianity","catholicism"],"_links":{"self":"/articles/192609"}},{"articleId":208741,"title":"Kabbalah For Dummies Cheat Sheet","slug":"kabbalah-for-dummies-cheat-sheet","categoryList":["body-mind-spirit","religion-spirituality","kabbalah"],"_links":{"self":"/articles/208741"}},{"articleId":230957,"title":"Nikon D3400 For Dummies Cheat Sheet","slug":"nikon-d3400-dummies-cheat-sheet","categoryList":["home-auto-hobbies","photography"],"_links":{"self":"/articles/230957"}},{"articleId":235851,"title":"Praying the Rosary and Meditating on the Mysteries","slug":"praying-rosary-meditating-mysteries","categoryList":["body-mind-spirit","religion-spirituality","christianity","catholicism"],"_links":{"self":"/articles/235851"}},{"articleId":284787,"title":"What Your Society Says About You","slug":"what-your-society-says-about-you","categoryList":["academics-the-arts","humanities"],"_links":{"self":"/articles/284787"}}],"inThisArticle":[{"label":"What is data orchestration?","target":"#tab1"},{"label":"Decentralizing the data center","target":"#tab2"},{"label":"Enabling a global data environment","target":"#tab3"}],"relatedArticles":{"fromBook":[],"fromCategory":[{"articleId":207996,"title":"Big Data For Dummies Cheat Sheet","slug":"big-data-for-dummies-cheat-sheet","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/207996"}},{"articleId":207478,"title":"Statistics for Big Data For Dummies Cheat Sheet","slug":"statistics-for-big-data-for-dummies-cheat-sheet","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/207478"}},{"articleId":207432,"title":"Big Data for Small Business For Dummies Cheat Sheet","slug":"big-data-for-small-business-for-dummies-cheat-sheet","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/207432"}},{"articleId":168988,"title":"Integrate Big Data with the Traditional Data Warehouse","slug":"integrate-big-data-with-the-traditional-data-warehouse","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168988"}},{"articleId":168986,"title":"Big Data Planning Stages","slug":"big-data-planning-stages","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168986"}}]},"hasRelatedBookFromSearch":false,"relatedBook":{"bookId":0,"slug":null,"isbn":null,"categoryList":null,"amazon":null,"image":null,"title":null,"testBankPinActivationLink":null,"bookOutOfPrint":false,"authorsInfo":null,"authors":null,"_links":null},"collections":[],"articleAds":{"footerAd":"<div class=\"du-ad-region row\" id=\"article_page_adhesion_ad\"><div class=\"du-ad-unit col-md-12\" data-slot-id=\"article_page_adhesion_ad\" data-refreshed=\"false\" \r\n data-target = \"[{&quot;key&quot;:&quot;cat&quot;,&quot;values&quot;:[&quot;technology&quot;,&quot;information-technology&quot;,&quot;data-science&quot;,&quot;big-data&quot;]},{&quot;key&quot;:&quot;isbn&quot;,&quot;values&quot;:[null]},{&quot;key&quot;:&quot;sponsored&quot;,&quot;values&quot;:[&quot;customsolutions&quot;]}]\" id=\"du-slot-656a1f65440bd\"></div></div>","rightAd":"<div class=\"du-ad-region row\" id=\"article_page_right_ad\"><div class=\"du-ad-unit col-md-12\" data-slot-id=\"article_page_right_ad\" data-refreshed=\"false\" \r\n data-target = \"[{&quot;key&quot;:&quot;cat&quot;,&quot;values&quot;:[&quot;technology&quot;,&quot;information-technology&quot;,&quot;data-science&quot;,&quot;big-data&quot;]},{&quot;key&quot;:&quot;isbn&quot;,&quot;values&quot;:[null]},{&quot;key&quot;:&quot;sponsored&quot;,&quot;values&quot;:[&quot;customsolutions&quot;]}]\" id=\"du-slot-656a1f6544d68\"></div></div>"},"articleType":{"articleType":"Articles","articleList":null,"content":null,"videoInfo":{"videoId":null,"name":null,"accountId":null,"playerId":null,"thumbnailUrl":null,"description":null,"uploadDate":null}},"sponsorship":{"sponsorshipPage":true,"backgroundImage":{"src":null,"width":0,"height":0},"brandingLine":"Brought to you by Hammerspace","brandingLink":"https://hammerspace.com/","brandingLogo":{"src":"https://www.dummies.com/wp-content/uploads/hammerspace-logo-266x55-1.png","width":266,"height":55},"sponsorAd":"","sponsorEbookTitle":"Unstructured Data Orchestration For Dummies, Hammerspace Special Edition","sponsorEbookLink":"https://hammerspace.com/for-dummies/","sponsorEbookImage":{"src":"https://www.dummies.com/wp-content/uploads/unstructured-data-orchestration-hammerspace-special-edition-cover-9781394211364-165x255.jpg","width":165,"height":255}},"primaryLearningPath":"Solve","lifeExpectancy":"One year","lifeExpectancySetFrom":"2023-12-05T00:00:00+00:00","dummiesForKids":"no","sponsoredContent":"no","adInfo":"","adPairKey":[{"adPairKey":"sponsored","adPairValue":"customsolutions"}]},"status":"publish","visibility":"public","articleId":301250},{"headers":{"creationTime":"2016-03-27T16:46:51+00:00","modifiedTime":"2022-04-12T20:12:23+00:00","timestamp":"2022-09-14T18:19:36+00:00"},"data":{"breadcrumbs":[{"name":"Technology","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33512"},"slug":"technology","categoryId":33512},{"name":"Information Technology","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33572"},"slug":"information-technology","categoryId":33572},{"name":"Data Science","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33577"},"slug":"data-science","categoryId":33577},{"name":"Big Data","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33578"},"slug":"big-data","categoryId":33578}],"title":"Big Data for Small Business For Dummies Cheat Sheet","strippedTitle":"big data for small business for dummies cheat sheet","slug":"big-data-for-small-business-for-dummies-cheat-sheet","canonicalUrl":"","seo":{"metaDescription":"Discover the key terminology you need to understand the crucial big data skills for businesses and how to communicate data to your company.","noIndex":0,"noFollow":0},"content":"Big data makes big headlines, but it’s much more than just a buzz phrase or the latest business fad. The phenomenon is very real and it’s producing concrete benefits in so many different areas – particularly in business. Here you will get to the heart of big data as a business owner or manager: You will take a look at the key terminology you need to understand the crucial big data skills for businesses, ten steps to using big data to make better decisions, and tips for communicating insights from data to your colleagues.","description":"Big data makes big headlines, but it’s much more than just a buzz phrase or the latest business fad. The phenomenon is very real and it’s producing concrete benefits in so many different areas – particularly in business. Here you will get to the heart of big data as a business owner or manager: You will take a look at the key terminology you need to understand the crucial big data skills for businesses, ten steps to using big data to make better decisions, and tips for communicating insights from data to your colleagues.","blurb":"","authors":[{"authorId":9052,"name":"Bernard Marr","slug":"bernard-marr","description":" <p><b>Bernard Marr</b> is a bestselling author on organisational performance and business success. He regularly advises leading companies, organisations and governments across the globe, and is acknowledged by the CEO Journal as one of today&#8217;s leading business brains. He has advised the Bank of England, Barclays, BP, Fujitsu, HSBC, Mars and others.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9052"}}],"primaryCategoryTaxonomy":{"categoryId":33578,"title":"Big Data","slug":"big-data","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33578"}},"secondaryCategoryTaxonomy":{"categoryId":34253,"title":"General Small Business","slug":"general-small-business","_links":{"self":"https://dummies-api.dummies.com/v2/categories/34253"}},"tertiaryCategoryTaxonomy":{"categoryId":0,"title":null,"slug":null,"_links":null},"trendingArticles":[{"articleId":192609,"title":"How to Pray the Rosary: A Comprehensive Guide","slug":"how-to-pray-the-rosary","categoryList":["body-mind-spirit","religion-spirituality","christianity","catholicism"],"_links":{"self":"/articles/192609"}},{"articleId":208741,"title":"Kabbalah For Dummies Cheat Sheet","slug":"kabbalah-for-dummies-cheat-sheet","categoryList":["body-mind-spirit","religion-spirituality","kabbalah"],"_links":{"self":"/articles/208741"}},{"articleId":230957,"title":"Nikon D3400 For Dummies Cheat Sheet","slug":"nikon-d3400-dummies-cheat-sheet","categoryList":["home-auto-hobbies","photography"],"_links":{"self":"/articles/230957"}},{"articleId":235851,"title":"Praying the Rosary and Meditating on the Mysteries","slug":"praying-rosary-meditating-mysteries","categoryList":["body-mind-spirit","religion-spirituality","christianity","catholicism"],"_links":{"self":"/articles/235851"}},{"articleId":284787,"title":"What Your Society Says About You","slug":"what-your-society-says-about-you","categoryList":["academics-the-arts","humanities"],"_links":{"self":"/articles/284787"}}],"inThisArticle":[],"relatedArticles":{"fromBook":[{"articleId":140207,"title":"10 Big Data Predictions for the Future","slug":"10-big-data-predictions-for-the-future","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/140207"}},{"articleId":140196,"title":"Big Data: Starting with Strategy","slug":"big-data-starting-with-strategy","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/140196"}},{"articleId":140195,"title":"Overcoming the Big Data Skills Shortage","slug":"overcoming-the-big-data-skills-shortage","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/140195"}},{"articleId":140190,"title":"Understanding Big Data and the Internet of Things","slug":"understanding-big-data-and-the-internet-of-things","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/140190"}},{"articleId":140156,"title":"6 Key Big Data Skills Every Business Needs","slug":"6-key-big-data-skills-every-business-needs","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/140156"}}],"fromCategory":[{"articleId":207996,"title":"Big Data For Dummies Cheat Sheet","slug":"big-data-for-dummies-cheat-sheet","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/207996"}},{"articleId":207478,"title":"Statistics for Big Data For Dummies Cheat Sheet","slug":"statistics-for-big-data-for-dummies-cheat-sheet","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/207478"}},{"articleId":168988,"title":"Integrate Big Data with the Traditional Data Warehouse","slug":"integrate-big-data-with-the-traditional-data-warehouse","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168988"}},{"articleId":168987,"title":"Best Practices for Big Data Integration","slug":"best-practices-for-big-data-integration","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168987"}},{"articleId":168985,"title":"How to Analyze Big Data to Get Results","slug":"how-to-analyze-big-data-to-get-results","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168985"}}]},"hasRelatedBookFromSearch":false,"relatedBook":{"bookId":281550,"slug":"big-data-for-small-business-for-dummies","isbn":"9781119027034","categoryList":["technology","information-technology","data-science","big-data"],"amazon":{"default":"https://www.amazon.com/gp/product/1119027039/ref=as_li_tl?ie=UTF8&tag=wiley01-20","ca":"https://www.amazon.ca/gp/product/1119027039/ref=as_li_tl?ie=UTF8&tag=wiley01-20","indigo_ca":"http://www.tkqlhce.com/click-9208661-13710633?url=https://www.chapters.indigo.ca/en-ca/books/product/1119027039-item.html&cjsku=978111945484","gb":"https://www.amazon.co.uk/gp/product/1119027039/ref=as_li_tl?ie=UTF8&tag=wiley01-20","de":"https://www.amazon.de/gp/product/1119027039/ref=as_li_tl?ie=UTF8&tag=wiley01-20"},"image":{"src":"https://www.dummies.com/wp-content/uploads/big-data-for-small-business-for-dummies-cover-9781119027034-203x255.jpg","width":203,"height":255},"title":"Big Data For Small Business For Dummies","testBankPinActivationLink":"","bookOutOfPrint":false,"authorsInfo":"<p><b data-author-id=\"9052\">Bernard Marr</b> helps companies to better manage, measure, report and analyse performance. His leading-edge work with major companies, organisations and governments across the globe makes him an acclaimed and award-winning keynote speaker, researcher, consultant and teacher. </p>","authors":[{"authorId":9052,"name":"Bernard Marr","slug":"bernard-marr","description":" <p><b>Bernard Marr</b> is a bestselling author on organisational performance and business success. He regularly advises leading companies, organisations and governments across the globe, and is acknowledged by the CEO Journal as one of today&#8217;s leading business brains. He has advised the Bank of England, Barclays, BP, Fujitsu, HSBC, Mars and others.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9052"}}],"_links":{"self":"https://dummies-api.dummies.com/v2/books/"}},"collections":[],"articleAds":{"footerAd":"<div class=\"du-ad-region row\" id=\"article_page_adhesion_ad\"><div class=\"du-ad-unit col-md-12\" data-slot-id=\"article_page_adhesion_ad\" data-refreshed=\"false\" \r\n data-target = \"[{&quot;key&quot;:&quot;cat&quot;,&quot;values&quot;:[&quot;technology&quot;,&quot;information-technology&quot;,&quot;data-science&quot;,&quot;big-data&quot;]},{&quot;key&quot;:&quot;isbn&quot;,&quot;values&quot;:[&quot;9781119027034&quot;]}]\" id=\"du-slot-63221b38b4d7b\"></div></div>","rightAd":"<div class=\"du-ad-region row\" id=\"article_page_right_ad\"><div class=\"du-ad-unit col-md-12\" data-slot-id=\"article_page_right_ad\" data-refreshed=\"false\" \r\n data-target = \"[{&quot;key&quot;:&quot;cat&quot;,&quot;values&quot;:[&quot;technology&quot;,&quot;information-technology&quot;,&quot;data-science&quot;,&quot;big-data&quot;]},{&quot;key&quot;:&quot;isbn&quot;,&quot;values&quot;:[&quot;9781119027034&quot;]}]\" id=\"du-slot-63221b38b56d8\"></div></div>"},"articleType":{"articleType":"Cheat Sheet","articleList":[{"articleId":140154,"title":"Understanding Big Data Jargon","slug":"understanding-big-data-jargon","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/140154"}},{"articleId":140156,"title":"6 Key Big Data Skills Every Business Needs","slug":"6-key-big-data-skills-every-business-needs","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/140156"}},{"articleId":140157,"title":"10 Steps to Using Data to Improve Business Decisions","slug":"10-steps-to-using-data-to-improve-business-decisions","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/140157"}},{"articleId":140155,"title":"How to Communicate Insights from Big Data","slug":"how-to-communicate-insights-from-big-data","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/140155"}}],"content":[{"title":"Understanding big data jargon","thumb":null,"image":null,"content":"<p>The technical jargon surrounding big data can seem a little daunting at first. The key phrases and terms you’re likely to come across, with easy-to-understand definitions for each, follow:</p>\n<ul class=\"level-one\">\n<li>\n<p class=\"first-para\"><b>Big data: </b>Increasingly, everything you do leaves a digital trace (or data), which you (and others) can use and analyse. The phrase <i>big data</i> refers to that data being collected and the ability to make use of it.</p>\n</li>\n<li>\n<p class=\"first-para\"><b>Big data analytics:</b><i> </i>This is the process of collecting, processing and analysing data to generate insights that inform fact-based decision making. In many cases it involves software-based analysis using algorithms.</p>\n</li>\n<li>\n<p class=\"first-para\"><b>Algorithm: </b>A mathematical formula or statistical process run by software to analyse data. It usually involves multiple calculation steps and can be used to automatically process data or solve problems.</p>\n</li>\n<li>\n<p class=\"first-para\"><b>Cloud computing:</b><i> </i>Software or data running on remote servers, rather than locally. So instead of storing or computing things on your own machine, you can use other computers that are connected to your computer via a network (such as the Internet).</p>\n</li>\n<li>\n<p class=\"first-para\"><b>Structured data:</b><i> </i>Any data or information located in a fixed field within a defined record or file, such as a database or spreadsheet. Its inherent structure makes it quick, easy and cheap to analyse.</p>\n</li>\n<li>\n<p class=\"first-para\"><b>Unstructured data:</b><i> </i>All the data not easily stored and indexed in traditional formats or databases. It includes email conversations, social media posts, video content, photos, voice recordings, sounds and so on. Its lack of structure makes it more difficult to analyse using traditional computer programs.</p>\n</li>\n<li>\n<p class=\"first-para\"><b>Semi-structured data:</b><i> </i>You guessed it, this is a cross between unstructured and structured data. It’s data that may have some structure that can be used for analysis but lacks the strict structure found in databases or spreadsheets. For example, a Facebook post can be categorised by author, date, length and even sentiment, but the content is generally unstructured.</p>\n</li>\n<li>\n<p class=\"first-para\"><b>Internal data: </b>This accounts for all the data your business currently has or could potentially access or generate in future. It could be structured in format (for example, a customer database) or it could be unstructured (conversational data from customer service calls).</p>\n</li>\n<li>\n<p class=\"first-para\"><b>External data: </b>Put simply, this is the infinite array of information that exists outside your business. It can be publically available or privately held and it can also be structured or unstructured in format.</p>\n</li>\n<li>\n<p class=\"first-para\"><b>The Internet of Things:</b><i> </i>A network that connects devices (the <i>things</i> referred to in the name) so that they can communicate with each other. This encompasses technology like smart televisions, smart phones, and sensors, and it’s all possible thanks to the massive increase in connectivity between devices, systems and services.</p>\n</li>\n</ul>\n"},{"title":"6 key big data skills every business needs","thumb":null,"image":null,"content":"<p>What are the key skills required to use big data successfully? The list here includes six key skills that all businesses should develop, either through recruiting data scientists who match these attributes, or by developing these skills in existing employees:</p>\n<ul class=\"level-one\">\n<li>\n<p class=\"first-para\"><b>Analytics:</b> This involves determining which data is relevant to the question you’re hoping to answer and interpreting the data in order to derive those answers. Key skills include a knack for spotting patterns and establishing links, the ability to make sense of a range of data (both structured and unstructured) and a sound knowledge of industry-standard analytics packages like SAS Analytics and Oracle Data Mining.</p>\n</li>\n<li>\n<p class=\"first-para\"><b>Creativity:</b> Anyone can be formulaic – you need to aim for innovation that will set your business apart from the pack. Creativity is especially important for any business hoping to make sense of <i>unstructured data</i> – data that doesn’t fit comfortably into tables and charts. Valuable creative skills include a knack for problem solving (perhaps even spotting problems others aren’t yet aware of) and the ability to come up with new ways of gathering and interpreting data.</p>\n</li>\n<li>\n<p class=\"first-para\"><b>Maths and statistics: </b>People with a strong background in maths or statistics have a good grounding for big data-related work. You’re looking for at least a basic grasp of statistics and the ability to wrangle messy data into figures that can be quantified so that you can draw conclusions from them.</p>\n</li>\n<li>\n<p class=\"first-para\"><b>Computer science:</b> This very broad category covers a whole range of subfields, such as machine learning, databases and cloud computing. It may cover everything from plugging together the cables to creating sophisticated machine learning and natural language processing algorithms. Key skills include a solid understanding of database technology and a firm grasp of technologies such as Hadoop, Java and Python.</p>\n</li>\n<li>\n<p class=\"first-para\"><b>Business acumen: </b>People who work with big data need a firm grasp of the company’s goals and objectives, as well as an understanding of whether the business is heading in the right direction. This includes understanding what makes the company tick, what makes it thrive and why it stands out from its competitors (and if it’s not thriving, why it’s not).</p>\n</li>\n<li>\n<p class=\"first-para\"><b>Communication: </b>You can have the best analytical skills in the world, but unless you’re able to present findings in a clear way and demonstrate how they can help to improve performance and drive success, all that analysis will go to waste. Great interpersonal and written communication skills are vital, as is the ability to add value to data through insights and analysis. A knack for storytelling and being able to bring data to life through visualization techniques will also help immensely.</p>\n</li>\n</ul>\n"},{"title":"10 steps to using data to improve business decisions","thumb":null,"image":null,"content":"<p>Data should be at the heart of strategic decision making in business, whether you run a huge multinational or a small family-run business. Big data can provide insights that help you answer your key business questions, such as ‘How can I improve customer satisfaction?’. Data leads to insights; business owners and managers can turn those insights into decisions and actions that improve the business.</p>\n<p class=\"Tip\">Use this ten-step process for making data-based decisions:</p>\n<ol class=\"level-one\">\n<li>\n<p class=\"first-para\">Start with strategy.</p>\n<p class=\"child-para\">Instead of starting with what data you could or should access, start by working out what your business is looking to achieve. In a nutshell, you need to work out what your strategic goals are, for example, increasing your customer base.</p>\n</li>\n<li>\n<p class=\"first-para\">Hone in on the business area; identify your strategic objectives.</p>\n<p class=\"child-para\">Identify the areas most important to achieving your overall strategy. For most businesses, the customer, finance and operations areas are key.</p>\n</li>\n<li>\n<p class=\"first-para\">Identify unanswered questions.</p>\n<p class=\"child-para\">Work out which questions you need to answer in order to achieve those goals. By working out exactly what you need to know, you can focus on the data that you really need.</p>\n</li>\n<li>\n<p class=\"first-para\">Find the data that will help answer those questions.</p>\n<p class=\"child-para\">Focus on identifying the ideal data for you – the data that could help you answer your most pressing questions and deliver on your strategic objectives.</p>\n</li>\n<li>\n<p class=\"first-para\">Identify what data you already have or have access to.</p>\n<p class=\"child-para\">After you identify the data you need, it makes sense to see if you’re already sitting on some of that information, even if it isn’t immediately obvious.</p>\n</li>\n<li>\n<p class=\"first-para\">Work out if the costs and effort are justified.</p>\n<p class=\"child-para\">Only after you know the costs can you work out if the tangible benefits outweigh those costs. In this respect, you should treat data like any other key business investment. You need to make a clear case for the investment that outlines the long-term value of data to the business strategy.</p>\n</li>\n<li>\n<p class=\"first-para\">Collect the data.</p>\n<p class=\"child-para\">Much of this step comes down to setting up the processes and people to gather and manage your data. You may be buying access to an analysis-ready data set, in which case there’s no need to collect data as such. But, in reality, many data projects require some amount of data collection.</p>\n</li>\n<li>\n<p class=\"first-para\">Analyze the data.</p>\n<p class=\"child-para\">You need to analyze the data in order to extract meaningful and useful business insights. After all, there’s no point coming this far if you don’t then discover something new from the data.</p>\n</li>\n<li>\n<p class=\"first-para\">Present and distribute the insights.</p>\n<p class=\"child-para\">Unless the results are presented to the right people at the right time in a meaningful way, then the size of the data sets or the sophistication of the analytics tools don’t really matter. You need to make sure the insights gained from your data are used to inform decision making and, ultimately, improve performance.</p>\n</li>\n<li>\n<p class=\"first-para\">Incorporate the learning into the business.</p>\n<p class=\"child-para\">Finally, you need to apply the insights from the data to your decision making, making the decisions that will transform your business for the better – and then acting on those decisions. For me, this is the most rewarding part of the data journey: turning data into action.</p>\n</li>\n</ol>\n"},{"title":"How to communicate insights from big data","thumb":null,"image":null,"content":"<p>Big data can help you gain insight. Businesses gain competitive advantage when the <i>right information</i> is delivered to the <i>right people</i> at the <i>right time.</i> This means extracting insights and information from data and communicating them to decision makers in a way they’ll easily understand. After all, people are less likely to act if they have to work hard to understand the information in front of them.</p>\n<p class=\"Tip\">Make sure your insights shine through with these top tips:</p>\n<ul class=\"level-one\">\n<li>\n<p class=\"first-para\"><b>Identify your target audience. </b>Who your audience is depends on your strategic questions. The audience may be you if you’re the business owner, or it could be your human resources team, your marketing team or a combination. Ask yourself who’s going to see these results. What do they already know about the issues being discussed? What do they need and want to know? And, what will they do with the information?</p>\n</li>\n<li>\n<p class=\"first-para\"><b>Customise the information for your audience.</b> Be prepared to customise your information to meet the specific requirements of each decision maker.</p>\n</li>\n<li>\n<p class=\"first-para\"><b>Remember what you’re trying to achieve.</b> Try not to get distracted by interesting insights that have nothing to do with answering your strategic questions and achieving your business goals. There may be scope to revisit those other insights in future but, for now, focus on what you set out to achieve.</p>\n</li>\n<li>\n<p class=\"first-para\"><b>Avoid creating a wall of text.</b> Remember that data can be presented as a number, a short written narrative, a table, a graph or a chart. In fact, the best approach is likely to involve a combination of these formats.</p>\n</li>\n<li>\n<p class=\"first-para\"><b>Use data visualisation techniques.</b> Visuals are great for conveying information because they’re quick and direct, they’re (usually) easy to understand, they’re memorable and they add interest, being much more likely to hold the reader’s attention than a full page of text.</p>\n</li>\n<li>\n<p class=\"first-para\"><b>But don’t neglect the text. </b>Numbers, charts and visuals may only give a snapshot; narrative allows you to embellish on key points. Use short narratives to introduce what you’re showing and highlight the key insights.</p>\n</li>\n<li>\n<p class=\"first-para\"><b>Use clear headings to make the important points stand out.</b> This way, even at a quick glance, the key points will be obvious.</p>\n</li>\n<li>\n<p class=\"first-para\"><b>Link the information to your strategy. </b>If you’re presenting information that directly answers a strategic business question, such as ‘How do we reduce staff turnover by ten per cent?’, include that question in the opening narrative and maybe even the headline.</p>\n</li>\n</ul>\n"}],"videoInfo":{"videoId":null,"name":null,"accountId":null,"playerId":null,"thumbnailUrl":null,"description":null,"uploadDate":null}},"sponsorship":{"sponsorshipPage":false,"backgroundImage":{"src":null,"width":0,"height":0},"brandingLine":"","brandingLink":"","brandingLogo":{"src":null,"width":0,"height":0},"sponsorAd":"","sponsorEbookTitle":"","sponsorEbookLink":"","sponsorEbookImage":{"src":null,"width":0,"height":0}},"primaryLearningPath":"Advance","lifeExpectancy":"One year","lifeExpectancySetFrom":"2022-04-12T00:00:00+00:00","dummiesForKids":"no","sponsoredContent":"no","adInfo":"","adPairKey":[]},"status":"publish","visibility":"public","articleId":207432},{"headers":{"creationTime":"2016-03-27T16:47:03+00:00","modifiedTime":"2022-03-10T20:12:30+00:00","timestamp":"2022-09-14T18:19:23+00:00"},"data":{"breadcrumbs":[{"name":"Technology","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33512"},"slug":"technology","categoryId":33512},{"name":"Information Technology","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33572"},"slug":"information-technology","categoryId":33572},{"name":"Data Science","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33577"},"slug":"data-science","categoryId":33577},{"name":"Big Data","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33578"},"slug":"big-data","categoryId":33578}],"title":"Statistics for Big Data For Dummies Cheat Sheet","strippedTitle":"statistics for big data for dummies cheat sheet","slug":"statistics-for-big-data-for-dummies-cheat-sheet","canonicalUrl":"","seo":{"metaDescription":"Overview of the three types of statistical measures: those of central tendency, central dispersion and association.","noIndex":0,"noFollow":0},"content":"Summary statistical measures represent the key properties of a sample or population as a single numerical value. This has the advantage of providing important information in a very compact form. It also simplifies comparing multiple samples or populations. Summary statistical measures can be divided into three types: measures of central tendency, measures of central dispersion, and measures of association.","description":"Summary statistical measures represent the key properties of a sample or population as a single numerical value. This has the advantage of providing important information in a very compact form. It also simplifies comparing multiple samples or populations. Summary statistical measures can be divided into three types: measures of central tendency, measures of central dispersion, and measures of association.","blurb":"","authors":[{"authorId":9080,"name":"Alan Anderson","slug":"alan-anderson","description":" <p><b>Alan Anderson</b>, PhD is a teacher of finance, economics, statistics, and math at Fordham and Fairfield universities as well as at Manhattanville and Purchase colleges. Outside of the academic environment he has many years of experience working as an economist, risk manager, and fixed income analyst. Alan received his PhD in economics from Fordham University, and an M.S. in financial engineering from Polytechnic University.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9080"}},{"authorId":9081,"name":"David Semmelroth","slug":"david-semmelroth","description":" <p><b>David Semmelroth</b> has two decades of experience translating customer data into actionable insights across the financial services, travel, and entertainment industries. David has consulted for Cedar Fair, Wachovia, National City, and TD Bank.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9081"}}],"primaryCategoryTaxonomy":{"categoryId":33578,"title":"Big Data","slug":"big-data","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33578"}},"secondaryCategoryTaxonomy":{"categoryId":0,"title":null,"slug":null,"_links":null},"tertiaryCategoryTaxonomy":{"categoryId":0,"title":null,"slug":null,"_links":null},"trendingArticles":[{"articleId":192609,"title":"How to Pray the Rosary: A Comprehensive Guide","slug":"how-to-pray-the-rosary","categoryList":["body-mind-spirit","religion-spirituality","christianity","catholicism"],"_links":{"self":"/articles/192609"}},{"articleId":208741,"title":"Kabbalah For Dummies Cheat Sheet","slug":"kabbalah-for-dummies-cheat-sheet","categoryList":["body-mind-spirit","religion-spirituality","kabbalah"],"_links":{"self":"/articles/208741"}},{"articleId":230957,"title":"Nikon D3400 For Dummies Cheat Sheet","slug":"nikon-d3400-dummies-cheat-sheet","categoryList":["home-auto-hobbies","photography"],"_links":{"self":"/articles/230957"}},{"articleId":235851,"title":"Praying the Rosary and Meditating on the Mysteries","slug":"praying-rosary-meditating-mysteries","categoryList":["body-mind-spirit","religion-spirituality","christianity","catholicism"],"_links":{"self":"/articles/235851"}},{"articleId":284787,"title":"What Your Society Says About You","slug":"what-your-society-says-about-you","categoryList":["academics-the-arts","humanities"],"_links":{"self":"/articles/284787"}}],"inThisArticle":[],"relatedArticles":{"fromBook":[{"articleId":142226,"title":"Discrete and Continuous Probability Distributions","slug":"discrete-and-continuous-probability-distributions","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/142226"}},{"articleId":142209,"title":"10 Key Concepts in Hypothesis Testing","slug":"10-key-concepts-in-hypothesis-testing","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/142209"}},{"articleId":142192,"title":"Overview of Graphical Techniques","slug":"overview-of-graphical-techniques","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/142192"}},{"articleId":142191,"title":"Overview of Hypothesis Testing","slug":"overview-of-hypothesis-testing","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/142191"}},{"articleId":142183,"title":"Measures of Association","slug":"measures-of-association","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/142183"}}],"fromCategory":[{"articleId":207996,"title":"Big Data For Dummies Cheat Sheet","slug":"big-data-for-dummies-cheat-sheet","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/207996"}},{"articleId":207432,"title":"Big Data for Small Business For Dummies Cheat Sheet","slug":"big-data-for-small-business-for-dummies-cheat-sheet","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/207432"}},{"articleId":168988,"title":"Integrate Big Data with the Traditional Data Warehouse","slug":"integrate-big-data-with-the-traditional-data-warehouse","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168988"}},{"articleId":168986,"title":"Big Data Planning Stages","slug":"big-data-planning-stages","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168986"}},{"articleId":168985,"title":"How to Analyze Big Data to Get Results","slug":"how-to-analyze-big-data-to-get-results","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168985"}}]},"hasRelatedBookFromSearch":false,"relatedBook":{"bookId":282602,"slug":"statistics-for-big-data-for-dummies","isbn":"9781118940013","categoryList":["technology","information-technology","data-science","big-data"],"amazon":{"default":"https://www.amazon.com/gp/product/1118940016/ref=as_li_tl?ie=UTF8&tag=wiley01-20","ca":"https://www.amazon.ca/gp/product/1118940016/ref=as_li_tl?ie=UTF8&tag=wiley01-20","indigo_ca":"http://www.tkqlhce.com/click-9208661-13710633?url=https://www.chapters.indigo.ca/en-ca/books/product/1118940016-item.html&cjsku=978111945484","gb":"https://www.amazon.co.uk/gp/product/1118940016/ref=as_li_tl?ie=UTF8&tag=wiley01-20","de":"https://www.amazon.de/gp/product/1118940016/ref=as_li_tl?ie=UTF8&tag=wiley01-20"},"image":{"src":"https://www.dummies.com/wp-content/uploads/statistics-for-big-data-for-dummies-cover-9781118940013-203x255.jpg","width":203,"height":255},"title":"Statistics for Big Data For Dummies","testBankPinActivationLink":"","bookOutOfPrint":false,"authorsInfo":"<p><b data-author-id=\"9080\">Alan Anderson, PhD,</b> is a professor of economics and finance at Fordham University and New York University. He's a veteran economist, risk manager, and fixed income analyst.</p> <p><b data-author-id=\"9081\">David Semmelroth</b> is an experienced data analyst, trainer, and statistics instructor who consults on customer databases and database marketing.</p>","authors":[{"authorId":9080,"name":"Alan Anderson","slug":"alan-anderson","description":" <p><b>Alan Anderson</b>, PhD is a teacher of finance, economics, statistics, and math at Fordham and Fairfield universities as well as at Manhattanville and Purchase colleges. Outside of the academic environment he has many years of experience working as an economist, risk manager, and fixed income analyst. Alan received his PhD in economics from Fordham University, and an M.S. in financial engineering from Polytechnic University.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9080"}},{"authorId":9081,"name":"David Semmelroth","slug":"david-semmelroth","description":" <p><b>David Semmelroth</b> has two decades of experience translating customer data into actionable insights across the financial services, travel, and entertainment industries. David has consulted for Cedar Fair, Wachovia, National City, and TD Bank.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9081"}}],"_links":{"self":"https://dummies-api.dummies.com/v2/books/"}},"collections":[],"articleAds":{"footerAd":"<div class=\"du-ad-region row\" id=\"article_page_adhesion_ad\"><div class=\"du-ad-unit col-md-12\" data-slot-id=\"article_page_adhesion_ad\" data-refreshed=\"false\" \r\n data-target = \"[{&quot;key&quot;:&quot;cat&quot;,&quot;values&quot;:[&quot;technology&quot;,&quot;information-technology&quot;,&quot;data-science&quot;,&quot;big-data&quot;]},{&quot;key&quot;:&quot;isbn&quot;,&quot;values&quot;:[&quot;9781118940013&quot;]}]\" id=\"du-slot-63221b2be3d5d\"></div></div>","rightAd":"<div class=\"du-ad-region row\" id=\"article_page_right_ad\"><div class=\"du-ad-unit col-md-12\" data-slot-id=\"article_page_right_ad\" data-refreshed=\"false\" \r\n data-target = \"[{&quot;key&quot;:&quot;cat&quot;,&quot;values&quot;:[&quot;technology&quot;,&quot;information-technology&quot;,&quot;data-science&quot;,&quot;big-data&quot;]},{&quot;key&quot;:&quot;isbn&quot;,&quot;values&quot;:[&quot;9781118940013&quot;]}]\" id=\"du-slot-63221b2be4729\"></div></div>"},"articleType":{"articleType":"Cheat Sheet","articleList":[{"articleId":142176,"title":"Measures of Central Tendency","slug":"measures-of-central-tendency","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/142176"}},{"articleId":142177,"title":"Measures of Central Dispersion","slug":"measures-of-central-dispersion","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/142177"}},{"articleId":142183,"title":"Measures of Association","slug":"measures-of-association","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/142183"}}],"content":[{"title":"Measures of central tendency","thumb":null,"image":null,"content":"<p>Measures of central tendency show the center of a data set. Three of the most commonly used measures of central tendency are the mean, median, and mode.</p>\n<h2>Mean</h2>\n<p><i>Mean </i>is another word for average. Here is the formula for computing the mean of a sample:</p>\n<p><img loading=\"lazy\" src=\"https://www.dummies.com/wp-content/uploads/484219.image0.jpg\" alt=\"image0.jpg\" width=\"77\" height=\"68\" /></p>\n<p>With this formula, you compute the sample mean by simply adding up all the elements in the sample and then dividing by the number of elements in the sample.</p>\n<p>Here is the corresponding formula for computing the mean of a population:</p>\n<p><img loading=\"lazy\" src=\"https://www.dummies.com/wp-content/uploads/484220.image1.jpg\" alt=\"image1.jpg\" width=\"73\" height=\"68\" /></p>\n<p>Although the notation is slightly different, the procedure for computing a population mean is the same as the procedure for computing a sample mean.</p>\n<p class=\"Tip\">Greek letters are used to describe populations, whereas Roman letters are used to describe samples.</p>\n<h2>Median</h2>\n<p>The <i>median</i> of a data set is a value that divides the data into two equal halves. In other words, half of the elements of a data set are <i>less than </i>the median, and the remaining half are <i>greater than </i>the median. The procedure for computing the median is the same for both samples and populations.</p>\n<h2>Mode</h2>\n<p>The mode of a data set is the most commonly observed value in the data set. You determine the mode in the same way for a sample and a population.</p>\n"},{"title":"Measures of central dispersion","thumb":null,"image":null,"content":"<p>Measures of central dispersion show how &#8220;spread out&#8221; the elements of a data set are from the mean. Three of the most commonly used measures of central dispersion include the following:</p>\n<ul class=\"level-one\">\n<li>\n<p class=\"first-para\">Range</p>\n</li>\n<li>\n<p class=\"first-para\">Variance</p>\n</li>\n<li>\n<p class=\"first-para\">Standard deviation</p>\n</li>\n</ul>\n<h2>Range</h2>\n<p>The <i>range</i> of a data set is the difference between the largest value and the smallest value. You compute it the same way for both samples and populations.</p>\n<h2>Variance</h2>\n<p>You can think of the variance as the average <i>squared</i> difference between the elements of a data set and the mean. The formulas for computing a sample variance and a population variance are slightly different.</p>\n<p>Here is the formula for computing sample variance:</p>\n<p><img loading=\"lazy\" src=\"https://www.dummies.com/wp-content/uploads/484223.image0.jpg\" alt=\"image0.jpg\" width=\"130\" height=\"68\" /></p>\n<p>And here is the formula for computing population variance:</p>\n<p><img loading=\"lazy\" src=\"https://www.dummies.com/wp-content/uploads/484224.image1.jpg\" alt=\"image1.jpg\" width=\"130\" height=\"68\" /></p>\n<h2>Standard deviation</h2>\n<p>The standard deviation is simply the square root of the variance. It&#8217;s more commonly used as a measure of dispersion than the variance because it&#8217;s measured in the same units as the elements of the data set, whereas the variance is measured in <i>squared </i>units.</p>\n"},{"title":"Measures of association","thumb":null,"image":null,"content":"<p>Measures of association quantify the strength and the direction of the relationship between two data sets. Here are the two most commonly used measures of association:</p>\n<ul class=\"level-one\">\n<li>\n<p class=\"first-para\">Covariance</p>\n</li>\n<li>\n<p class=\"first-para\">Correlation</p>\n</li>\n</ul>\n<p>Both measures are used to show how closely two data sets are related to each other. The main difference between them is the units in which they are measured. The correlation measure is defined to assume values between –1 and 1, which makes interpretation very easy.</p>\n<h2>Covariance</h2>\n<p>The <i>covariance</i> between two samples is computed as follows:</p>\n<p><img loading=\"lazy\" src=\"https://www.dummies.com/wp-content/uploads/484213.image0.jpg\" alt=\"image0.jpg\" width=\"189\" height=\"68\" /></p>\n<p>The covariance between two populations is computed as follows:</p>\n<p><img loading=\"lazy\" src=\"https://www.dummies.com/wp-content/uploads/484214.image1.jpg\" alt=\"image1.jpg\" width=\"208\" height=\"68\" /></p>\n<h2>Correlation</h2>\n<p>The <i>correlation</i> between two samples is computed like this:</p>\n<p><img loading=\"lazy\" src=\"https://www.dummies.com/wp-content/uploads/484215.image2.jpg\" alt=\"image2.jpg\" width=\"84\" height=\"44\" /></p>\n<p>The correlation between two populations is computed like this:</p>\n<p><img loading=\"lazy\" src=\"https://www.dummies.com/wp-content/uploads/484216.image3.jpg\" alt=\"image3.jpg\" width=\"94\" height=\"44\" /></p>\n"}],"videoInfo":{"videoId":null,"name":null,"accountId":null,"playerId":null,"thumbnailUrl":null,"description":null,"uploadDate":null}},"sponsorship":{"sponsorshipPage":false,"backgroundImage":{"src":null,"width":0,"height":0},"brandingLine":"","brandingLink":"","brandingLogo":{"src":null,"width":0,"height":0},"sponsorAd":"","sponsorEbookTitle":"","sponsorEbookLink":"","sponsorEbookImage":{"src":null,"width":0,"height":0}},"primaryLearningPath":"Advance","lifeExpectancy":"Five years","lifeExpectancySetFrom":"2022-03-10T00:00:00+00:00","dummiesForKids":"no","sponsoredContent":"no","adInfo":"","adPairKey":[]},"status":"publish","visibility":"public","articleId":207478},{"headers":{"creationTime":"2016-03-27T16:49:35+00:00","modifiedTime":"2022-02-09T18:46:11+00:00","timestamp":"2022-09-14T18:19:05+00:00"},"data":{"breadcrumbs":[{"name":"Technology","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33512"},"slug":"technology","categoryId":33512},{"name":"Information Technology","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33572"},"slug":"information-technology","categoryId":33572},{"name":"Data Science","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33577"},"slug":"data-science","categoryId":33577},{"name":"Big Data","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33578"},"slug":"big-data","categoryId":33578}],"title":"Big Data For Dummies Cheat Sheet","strippedTitle":"big data for dummies cheat sheet","slug":"big-data-for-dummies-cheat-sheet","canonicalUrl":"","seo":{"metaDescription":"Learn the essentials of managing big data, including unstructured data, infrastructure, what Hadoop is, and how to develop a strategy.","noIndex":0,"noFollow":0},"content":"To stay competitive today, companies must find practical ways to deal with big data — that is, to learn new ways to capture and analyze growing amounts of information about customers, products, and services.\r\n\r\nData is becoming increasingly complex in structured and unstructured ways. New sources of data come from machines, such as sensors; social business sites; and website interaction, such as click-stream data. Meeting these changing business requirements demands that the right information be available at the right time.","description":"To stay competitive today, companies must find practical ways to deal with big data — that is, to learn new ways to capture and analyze growing amounts of information about customers, products, and services.\r\n\r\nData is becoming increasingly complex in structured and unstructured ways. New sources of data come from machines, such as sensors; social business sites; and website interaction, such as click-stream data. Meeting these changing business requirements demands that the right information be available at the right time.","blurb":"","authors":[{"authorId":9411,"name":"Judith S. Hurwitz","slug":"judith-hurwitz","description":"Judith Hurwitz is president and CEO of Hurwitz & Associates, specializing in cloud computing, service management, information management, and business strategy.","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9411"}},{"authorId":9727,"name":"Alan Nugent","slug":"alan-nugent","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9727"}},{"authorId":9413,"name":"Fern Halper","slug":"fern-halper","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9413"}},{"authorId":9412,"name":"Marcia Kaufman","slug":"marcia-kaufman","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9412"}}],"primaryCategoryTaxonomy":{"categoryId":33578,"title":"Big Data","slug":"big-data","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33578"}},"secondaryCategoryTaxonomy":{"categoryId":0,"title":null,"slug":null,"_links":null},"tertiaryCategoryTaxonomy":{"categoryId":0,"title":null,"slug":null,"_links":null},"trendingArticles":[{"articleId":192609,"title":"How to Pray the Rosary: A Comprehensive Guide","slug":"how-to-pray-the-rosary","categoryList":["body-mind-spirit","religion-spirituality","christianity","catholicism"],"_links":{"self":"/articles/192609"}},{"articleId":208741,"title":"Kabbalah For Dummies Cheat Sheet","slug":"kabbalah-for-dummies-cheat-sheet","categoryList":["body-mind-spirit","religion-spirituality","kabbalah"],"_links":{"self":"/articles/208741"}},{"articleId":230957,"title":"Nikon D3400 For Dummies Cheat Sheet","slug":"nikon-d3400-dummies-cheat-sheet","categoryList":["home-auto-hobbies","photography"],"_links":{"self":"/articles/230957"}},{"articleId":235851,"title":"Praying the Rosary and Meditating on the Mysteries","slug":"praying-rosary-meditating-mysteries","categoryList":["body-mind-spirit","religion-spirituality","christianity","catholicism"],"_links":{"self":"/articles/235851"}},{"articleId":284787,"title":"What Your Society Says About You","slug":"what-your-society-says-about-you","categoryList":["academics-the-arts","humanities"],"_links":{"self":"/articles/284787"}}],"inThisArticle":[],"relatedArticles":{"fromBook":[{"articleId":168988,"title":"Integrate Big Data with the Traditional Data Warehouse","slug":"integrate-big-data-with-the-traditional-data-warehouse","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168988"}},{"articleId":168986,"title":"Big Data Planning Stages","slug":"big-data-planning-stages","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168986"}},{"articleId":168985,"title":"How to Analyze Big Data to Get Results","slug":"how-to-analyze-big-data-to-get-results","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168985"}},{"articleId":168987,"title":"Best Practices for Big Data Integration","slug":"best-practices-for-big-data-integration","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168987"}},{"articleId":168984,"title":"Ten Hot Big Data Trends","slug":"ten-hot-big-data-trends","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168984"}}],"fromCategory":[{"articleId":207478,"title":"Statistics for Big Data For Dummies Cheat Sheet","slug":"statistics-for-big-data-for-dummies-cheat-sheet","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/207478"}},{"articleId":207432,"title":"Big Data for Small Business For Dummies Cheat Sheet","slug":"big-data-for-small-business-for-dummies-cheat-sheet","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/207432"}},{"articleId":168988,"title":"Integrate Big Data with the Traditional Data Warehouse","slug":"integrate-big-data-with-the-traditional-data-warehouse","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168988"}},{"articleId":168987,"title":"Best Practices for Big Data Integration","slug":"best-practices-for-big-data-integration","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168987"}},{"articleId":168985,"title":"How to Analyze Big Data to Get Results","slug":"how-to-analyze-big-data-to-get-results","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168985"}}]},"hasRelatedBookFromSearch":false,"relatedBook":{"bookId":281637,"slug":"big-data-for-dummies","isbn":"9781118504222","categoryList":["technology","information-technology","data-science","big-data"],"amazon":{"default":"https://www.amazon.com/gp/product/1118504224/ref=as_li_tl?ie=UTF8&tag=wiley01-20","ca":"https://www.amazon.ca/gp/product/1118504224/ref=as_li_tl?ie=UTF8&tag=wiley01-20","indigo_ca":"http://www.tkqlhce.com/click-9208661-13710633?url=https://www.chapters.indigo.ca/en-ca/books/product/1118504224-item.html&cjsku=978111945484","gb":"https://www.amazon.co.uk/gp/product/1118504224/ref=as_li_tl?ie=UTF8&tag=wiley01-20","de":"https://www.amazon.de/gp/product/1118504224/ref=as_li_tl?ie=UTF8&tag=wiley01-20"},"image":{"src":"https://www.dummies.com/wp-content/uploads/big-data-for-dummies-cover-9781118504222-203x255.jpg","width":203,"height":255},"title":"Big Data For Dummies","testBankPinActivationLink":"","bookOutOfPrint":false,"authorsInfo":"<p><b data-author-id=\"34961\">Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy. Alan Nugent has extensive experience in cloud-based big data solutions. Dr. Fern Halper specializes in big data and analytics. Marcia Kaufman specializes in cloud infrastructure, information management, and analytics.</p>","authors":[{"authorId":9727,"name":"Alan Nugent","slug":"alan-nugent","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9727"}},{"authorId":9413,"name":"Fern Halper","slug":"fern-halper","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9413"}},{"authorId":34961,"name":"Judith S. Hurwitz","slug":"judith-s-hurwitz","description":" <p><b>Daniel Kirsch,</b> Managing Director of Hurwitz &#38; Associates, is a thought leader, researcher, author, and consultant in cloud, AI, and security. <b>Judith Hurwitz,</b> President of Hurwitz &#38; Associates, is a consultant, thought leader, and coauthor of 10 books including <i>Augmented Intelligence, Cognitive Computing and Big Data Analytics,</i> and <i>Hybrid Cloud for Dummies</i> ","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/34961"}},{"authorId":9412,"name":"Marcia Kaufman","slug":"marcia-kaufman","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9412"}}],"_links":{"self":"https://dummies-api.dummies.com/v2/books/"}},"collections":[],"articleAds":{"footerAd":"<div class=\"du-ad-region row\" id=\"article_page_adhesion_ad\"><div class=\"du-ad-unit col-md-12\" data-slot-id=\"article_page_adhesion_ad\" data-refreshed=\"false\" \r\n data-target = \"[{&quot;key&quot;:&quot;cat&quot;,&quot;values&quot;:[&quot;technology&quot;,&quot;information-technology&quot;,&quot;data-science&quot;,&quot;big-data&quot;]},{&quot;key&quot;:&quot;isbn&quot;,&quot;values&quot;:[&quot;9781118504222&quot;]}]\" id=\"du-slot-63221b19ad645\"></div></div>","rightAd":"<div class=\"du-ad-region row\" id=\"article_page_right_ad\"><div class=\"du-ad-unit col-md-12\" data-slot-id=\"article_page_right_ad\" data-refreshed=\"false\" \r\n data-target = \"[{&quot;key&quot;:&quot;cat&quot;,&quot;values&quot;:[&quot;technology&quot;,&quot;information-technology&quot;,&quot;data-science&quot;,&quot;big-data&quot;]},{&quot;key&quot;:&quot;isbn&quot;,&quot;values&quot;:[&quot;9781118504222&quot;]}]\" id=\"du-slot-63221b19ae0eb\"></div></div>"},"articleType":{"articleType":"Cheat Sheet","articleList":[{"articleId":167843,"title":"Defining Big Data: Volume, Velocity, and Variety","slug":"defining-big-data-volume-velocity-and-variety","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/167843"}},{"articleId":167841,"title":"Understanding Unstructured Data","slug":"understanding-unstructured-data","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/167841"}},{"articleId":167840,"title":"The Role of Traditional Operational Data in the Big Data Environment","slug":"the-role-of-traditional-operational-data-in-the-big-data-environment","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/167840"}},{"articleId":167839,"title":"Basics of Big Data Infrastructure","slug":"basics-of-big-data-infrastructure","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/167839"}},{"articleId":167835,"title":"Managing Big Data with Hadoop: HDFS and MapReduce","slug":"managing-big-data-with-hadoop-hdfs-and-mapreduce","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/167835"}},{"articleId":167834,"title":"Laying the Groundwork for Your Big Data Strategy","slug":"laying-the-groundwork-for-your-big-data-strategy","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/167834"}}],"content":[{"title":"Defining big data: Volume, Velocity, and Variety","thumb":null,"image":null,"content":"<p>Big data enables organizations to store, manage, and manipulate vast amounts of disparate data at the right speed and at the right time. To gain the right insights, big data is typically broken down by three characteristics:</p>\n<ul class=\"level-one\">\n<li>\n<p class=\"first-para\"><b>Volume:</b> How much data</p>\n</li>\n<li>\n<p class=\"first-para\"><b>Velocity:</b> How fast data is processed</p>\n</li>\n<li>\n<p class=\"first-para\"><b>Variety:</b> The various types of data</p>\n</li>\n</ul>\n<p>While it is convenient to simplify big data into the three Vs, it can be misleading and overly simplistic. For example, you may be managing a relatively small amount of very disparate, complex data or you may be processing a huge volume of very simple data. That simple data may be all structured or all unstructured.</p>\n<p class=\"Remember\">Even more important is the fourth V, <i>veracity.</i> How accurate is that data in predicting business value? Do the results of a big data analysis actually make sense? Data must be able to be verified based on both accuracy and context. An innovative business may want to be able to analyze massive amounts of data in real time to quickly assess the value of that customer and the potential to provide additional offers to that customer. It is necessary to identify the right amount and types of data that can be analyzed in real time to impact business outcomes.</p>\n<p>Big data incorporates all the varieties of data, including structured data and unstructured data from e-mails, social media, text streams, and so on. This kind of data management requires companies to leverage both their structured and unstructured data.</p>\n"},{"title":"Understanding unstructured data","thumb":null,"image":null,"content":"<p>Unstructured data is different than structured data in that its structure is unpredictable. Examples of unstructured data include documents, e-mails, blogs, digital images, videos, and satellite imagery. It also includes some data generated by machines or sensors. In fact, unstructured data accounts for the majority of data that&#8217;s on your company&#8217;s premises as well as external to your company in online private and public sources such as Twitter and Facebook.</p>\n<p>In the past, most companies weren&#8217;t able to either capture or store this vast amount of data. It was simply too expensive or too overwhelming. Even if companies were able to capture the data, they didn&#8217;t have the tools to easily analyze the data and use the results to make decisions. Very few tools could make sense of these vast amounts of data. The tools that did exist were complex to use and did not produce results in a reasonable time frame.</p>\n<p>In the end, those who really wanted to go to the enormous effort of analyzing this data were forced to work with snapshots of data. This has the undesirable effect of missing important events because they were not in a particular snapshot.</p>\n<p class=\"Remember\">One approach that is becoming increasingly valued as a way to gain business value from unstructured data is <i>text analytics,</i> the process of analyzing unstructured text, extracting relevant information, and transforming it into structured information that can then be leveraged in various ways. The analysis and extraction processes take advantage of techniques that originated in computational linguistics, statistics, and other computer science disciplines.</p>\n"},{"title":"The role of traditional operational data in the big data environment","thumb":null,"image":null,"content":"<p>Knowing what data is stored and where it is stored are critical building blocks in your big data implementation. It&#8217;s unlikely that you&#8217;ll use RDBMSs for the core of the implementation, but it&#8217;s very likely that you&#8217;ll need to rely on the data stored in RDBMSs to create the highest level of value to the business with big data.</p>\n<p class=\"TechnicalStuff\">Most large and small companies probably store most of their important operational information in relational database management systems (RDBMSs), which are built on one or more relations and represented by tables. These tables are defined by the way the data is stored.The data is stored in database objects called tables — organized in rows and columns. RDBMSs follow a consistent approach in the way that data is stored and retrieved.</p>\n<p>To get the most business value from your real-time analysis of unstructured data, you need to understand that data in context with your historical data on customers, products, transactions, and operations. In other words, you will need to integrate your unstructured data with your traditional operational data.</p>\n"},{"title":"Basics of big data infrastructure","thumb":null,"image":null,"content":"<p>Big data is all about high velocity, large volumes, and wide data variety, so the physical infrastructure will literally &#8220;make or break&#8221; the implementation. Most big data implementations need to be highly available, so the networks, servers, and physical storage must be resilient and redundant.</p>\n<p class=\"Remember\">Resiliency and redundancy are interrelated. An infrastructure, or a system, is resilient to failure or changes when sufficient redundant resources are in place ready to jump into action. Resiliency helps to eliminate single points of failure in your infrastructure. For example, if only one network connection exists between your business and the Internet, you have no network redundancy, and the infrastructure is not resilient with respect to a network outage.</p>\n<p>In large data centers with business continuity requirements, most of the redundancy is in place and can be leveraged to create a big data environment. In new implementations, the designers have the responsibility to map the deployment to the needs of the business based on costs and performance.</p>\n"},{"title":"Managing big data with Hadoop: HDFS and MapReduce","thumb":null,"image":null,"content":"<p>Hadoop, an open-source software framework, uses HDFS (the Hadoop Distributed File System) and MapReduce to analyze big data on clusters of commodity hardware—that is, in a distributed computing environment.</p>\n<p>The Hadoop Distributed File System (HDFS) was developed to allow companies to more easily manage huge volumes of data in a simple and pragmatic way. Hadoop allows big problems to be decomposed into smaller elements so that analysis can be done quickly and cost effectively. HDFS is a versatile, resilient, clustered approach to managing files in a big data environment.</p>\n<p>HDFS is not the final destination for files. Rather it is a data &#8220;service&#8221; that offers a unique set of capabilities needed when data volumes and velocity are high.</p>\n<p>MapReduce is a software framework that enables developers to write programs that can process massive amounts of unstructured data in parallel across a distributed group of processors. MapReduce was designed by Google as a way of efficiently executing a set of functions against a large amount of data in batch mode.</p>\n<p>The &#8220;map&#8221; component distributes the programming problem or tasks across a large number of systems and handles the placement of the tasks in a way that balances the load and manages recovery from failures. After the distributed computation is completed, another function called &#8220;reduce&#8221; aggregates all the elements back together to provide a result. An example of MapReduce usage would be to determine how many pages of a book are written in each of 50 different languages.</p>\n"},{"title":"Laying the groundwork for your big data strategy","thumb":null,"image":null,"content":"<p>Companies are swimming in big data. The problem is that they often don&#8217;t know how to pragmatically use that data to be able to predict the future, execute important business processes, or simply gain new insights. The goal of your big data strategy and plan should be to find a pragmatic way to leverage data for more predictable business outcomes.</p>\n<p>Begin your big data strategy by embarking on a discovery process. You need to get a handle on what data you already have, where it is, who owns and controls it, and how it is currently used. For example, what are the third-party data sources that your company relies on? This process can give you a lot of insights:</p>\n<ul class=\"level-one\">\n<li>\n<p class=\"first-para\">You can determine how many data sources you have and how much overlap exists.</p>\n</li>\n<li>\n<p class=\"first-para\">You can identify gaps exist in knowledge about those data sources.</p>\n</li>\n<li>\n<p class=\"first-para\">You might discover that you have lots of duplicate data in one area of the business and almost no data in another area.</p>\n</li>\n<li>\n<p class=\"first-para\">You might ascertain that you are dependent on third-party data that isn&#8217;t as accurate as it should be.</p>\n</li>\n</ul>\n<p class=\"Remember\">Spend the time you need to do this discovery process because it will be the foundation for your planning and execution of your big data strategy.</p>\n"}],"videoInfo":{"videoId":null,"name":null,"accountId":null,"playerId":null,"thumbnailUrl":null,"description":null,"uploadDate":null}},"sponsorship":{"sponsorshipPage":false,"backgroundImage":{"src":null,"width":0,"height":0},"brandingLine":"","brandingLink":"","brandingLogo":{"src":null,"width":0,"height":0},"sponsorAd":"","sponsorEbookTitle":"","sponsorEbookLink":"","sponsorEbookImage":{"src":null,"width":0,"height":0}},"primaryLearningPath":"Advance","lifeExpectancy":"One year","lifeExpectancySetFrom":"2022-02-09T00:00:00+00:00","dummiesForKids":"no","sponsoredContent":"no","adInfo":"","adPairKey":[]},"status":"publish","visibility":"public","articleId":207996},{"headers":{"creationTime":"2016-03-26T15:32:00+00:00","modifiedTime":"2016-03-26T15:32:00+00:00","timestamp":"2022-09-14T18:05:35+00:00"},"data":{"breadcrumbs":[{"name":"Technology","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33512"},"slug":"technology","categoryId":33512},{"name":"Information Technology","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33572"},"slug":"information-technology","categoryId":33572},{"name":"Data Science","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33577"},"slug":"data-science","categoryId":33577},{"name":"Big Data","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33578"},"slug":"big-data","categoryId":33578}],"title":"Integrate Big Data with the Traditional Data Warehouse","strippedTitle":"integrate big data with the traditional data warehouse","slug":"integrate-big-data-with-the-traditional-data-warehouse","canonicalUrl":"","seo":{"metaDescription":"While the worlds of big data and the traditional data warehouse will intersect, they are unlikely to merge anytime soon. Think of a data warehouse as a system o","noIndex":0,"noFollow":0},"content":"<p>While the worlds of big data and the traditional data warehouse will intersect, they are unlikely to merge anytime soon. Think of a data warehouse as a system of record for business intelligence, much like a customer relationship management (CRM) or accounting system. These systems are highly structured and optimized for specific purposes. In addition, these systems of record tend to be highly centralized.</p>\n<p>The diagram shows a typical approach to data flows with warehouses and marts:</p>\n<p><img src=\"https://www.dummies.com/wp-content/uploads/362967.image0.jpg\" width=\"400\" height=\"289\" alt=\"image0.jpg\"/></p>\n<p>Organizations will inevitably continue to use data warehouses to manage the type of structured and operational data that characterizes systems of record. These data warehouses will still provide business analysts with the ability to analyze key data, trends, and so on. However, the advent of big data is both challenging the role of the data warehouse and providing a complementary approach.</p>\n<p class=\"Remember\">Think of the relationship between the data warehouse and big data as merging to become a hybrid structure. In this hybrid model, the highly structured optimized operational data remains in the tightly controlled data warehouse, while the data that is highly distributed and subject to change in real time is controlled by a Hadoop-based (or similar NoSQL) infrastructure.</p>\n<p>It's inevitable that operational and structured data will have to interact in the world of big data, where the information sources have not (necessarily) been cleansed or profiled. Increasingly, organizations are understanding that they have a business requirement to be able to combine traditional data warehouses with their historical business data sources with less structured and vetted big data sources. A hybrid approach supporting traditional and big data sources can help to accomplish these business goals.</p>","description":"<p>While the worlds of big data and the traditional data warehouse will intersect, they are unlikely to merge anytime soon. Think of a data warehouse as a system of record for business intelligence, much like a customer relationship management (CRM) or accounting system. These systems are highly structured and optimized for specific purposes. In addition, these systems of record tend to be highly centralized.</p>\n<p>The diagram shows a typical approach to data flows with warehouses and marts:</p>\n<p><img src=\"https://www.dummies.com/wp-content/uploads/362967.image0.jpg\" width=\"400\" height=\"289\" alt=\"image0.jpg\"/></p>\n<p>Organizations will inevitably continue to use data warehouses to manage the type of structured and operational data that characterizes systems of record. These data warehouses will still provide business analysts with the ability to analyze key data, trends, and so on. However, the advent of big data is both challenging the role of the data warehouse and providing a complementary approach.</p>\n<p class=\"Remember\">Think of the relationship between the data warehouse and big data as merging to become a hybrid structure. In this hybrid model, the highly structured optimized operational data remains in the tightly controlled data warehouse, while the data that is highly distributed and subject to change in real time is controlled by a Hadoop-based (or similar NoSQL) infrastructure.</p>\n<p>It's inevitable that operational and structured data will have to interact in the world of big data, where the information sources have not (necessarily) been cleansed or profiled. Increasingly, organizations are understanding that they have a business requirement to be able to combine traditional data warehouses with their historical business data sources with less structured and vetted big data sources. A hybrid approach supporting traditional and big data sources can help to accomplish these business goals.</p>","blurb":"","authors":[{"authorId":9411,"name":"Judith S. Hurwitz","slug":"judith-hurwitz","description":"Judith Hurwitz is president and CEO of Hurwitz & Associates, specializing in cloud computing, service management, information management, and business strategy.","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9411"}},{"authorId":9727,"name":"Alan Nugent","slug":"alan-nugent","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9727"}},{"authorId":9413,"name":"Fern Halper","slug":"fern-halper","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9413"}},{"authorId":9412,"name":"Marcia Kaufman","slug":"marcia-kaufman","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9412"}}],"primaryCategoryTaxonomy":{"categoryId":33578,"title":"Big Data","slug":"big-data","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33578"}},"secondaryCategoryTaxonomy":{"categoryId":0,"title":null,"slug":null,"_links":null},"tertiaryCategoryTaxonomy":{"categoryId":0,"title":null,"slug":null,"_links":null},"trendingArticles":[{"articleId":192609,"title":"How to Pray the Rosary: A Comprehensive Guide","slug":"how-to-pray-the-rosary","categoryList":["body-mind-spirit","religion-spirituality","christianity","catholicism"],"_links":{"self":"/articles/192609"}},{"articleId":208741,"title":"Kabbalah For Dummies Cheat Sheet","slug":"kabbalah-for-dummies-cheat-sheet","categoryList":["body-mind-spirit","religion-spirituality","kabbalah"],"_links":{"self":"/articles/208741"}},{"articleId":230957,"title":"Nikon D3400 For Dummies Cheat Sheet","slug":"nikon-d3400-dummies-cheat-sheet","categoryList":["home-auto-hobbies","photography"],"_links":{"self":"/articles/230957"}},{"articleId":235851,"title":"Praying the Rosary and Meditating on the Mysteries","slug":"praying-rosary-meditating-mysteries","categoryList":["body-mind-spirit","religion-spirituality","christianity","catholicism"],"_links":{"self":"/articles/235851"}},{"articleId":284787,"title":"What Your Society Says About You","slug":"what-your-society-says-about-you","categoryList":["academics-the-arts","humanities"],"_links":{"self":"/articles/284787"}}],"inThisArticle":[],"relatedArticles":{"fromBook":[{"articleId":207996,"title":"Big Data For Dummies Cheat Sheet","slug":"big-data-for-dummies-cheat-sheet","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/207996"}},{"articleId":168986,"title":"Big Data Planning Stages","slug":"big-data-planning-stages","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168986"}},{"articleId":168985,"title":"How to Analyze Big Data to Get Results","slug":"how-to-analyze-big-data-to-get-results","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168985"}},{"articleId":168987,"title":"Best Practices for Big Data Integration","slug":"best-practices-for-big-data-integration","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168987"}},{"articleId":168984,"title":"Ten Hot Big Data Trends","slug":"ten-hot-big-data-trends","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168984"}}],"fromCategory":[{"articleId":207996,"title":"Big Data For Dummies Cheat Sheet","slug":"big-data-for-dummies-cheat-sheet","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/207996"}},{"articleId":207478,"title":"Statistics for Big Data For Dummies Cheat Sheet","slug":"statistics-for-big-data-for-dummies-cheat-sheet","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/207478"}},{"articleId":207432,"title":"Big Data for Small Business For Dummies Cheat Sheet","slug":"big-data-for-small-business-for-dummies-cheat-sheet","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/207432"}},{"articleId":168987,"title":"Best Practices for Big Data Integration","slug":"best-practices-for-big-data-integration","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168987"}},{"articleId":168985,"title":"How to Analyze Big Data to Get Results","slug":"how-to-analyze-big-data-to-get-results","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168985"}}]},"hasRelatedBookFromSearch":false,"relatedBook":{"bookId":281637,"slug":"big-data-for-dummies","isbn":"9781118504222","categoryList":["technology","information-technology","data-science","big-data"],"amazon":{"default":"https://www.amazon.com/gp/product/1118504224/ref=as_li_tl?ie=UTF8&tag=wiley01-20","ca":"https://www.amazon.ca/gp/product/1118504224/ref=as_li_tl?ie=UTF8&tag=wiley01-20","indigo_ca":"http://www.tkqlhce.com/click-9208661-13710633?url=https://www.chapters.indigo.ca/en-ca/books/product/1118504224-item.html&cjsku=978111945484","gb":"https://www.amazon.co.uk/gp/product/1118504224/ref=as_li_tl?ie=UTF8&tag=wiley01-20","de":"https://www.amazon.de/gp/product/1118504224/ref=as_li_tl?ie=UTF8&tag=wiley01-20"},"image":{"src":"https://www.dummies.com/wp-content/uploads/big-data-for-dummies-cover-9781118504222-203x255.jpg","width":203,"height":255},"title":"Big Data For Dummies","testBankPinActivationLink":"","bookOutOfPrint":false,"authorsInfo":"<p><b data-author-id=\"34961\">Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy. Alan Nugent has extensive experience in cloud-based big data solutions. Dr. Fern Halper specializes in big data and analytics. Marcia Kaufman specializes in cloud infrastructure, information management, and analytics.</p>","authors":[{"authorId":9727,"name":"Alan Nugent","slug":"alan-nugent","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9727"}},{"authorId":9413,"name":"Fern Halper","slug":"fern-halper","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9413"}},{"authorId":34961,"name":"Judith S. Hurwitz","slug":"judith-s-hurwitz","description":" <p><b>Daniel Kirsch,</b> Managing Director of Hurwitz &#38; Associates, is a thought leader, researcher, author, and consultant in cloud, AI, and security. <b>Judith Hurwitz,</b> President of Hurwitz &#38; Associates, is a consultant, thought leader, and coauthor of 10 books including <i>Augmented Intelligence, Cognitive Computing and Big Data Analytics,</i> and <i>Hybrid Cloud for Dummies</i> ","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/34961"}},{"authorId":9412,"name":"Marcia Kaufman","slug":"marcia-kaufman","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9412"}}],"_links":{"self":"https://dummies-api.dummies.com/v2/books/"}},"collections":[],"articleAds":{"footerAd":"<div class=\"du-ad-region row\" id=\"article_page_adhesion_ad\"><div class=\"du-ad-unit col-md-12\" data-slot-id=\"article_page_adhesion_ad\" data-refreshed=\"false\" \r\n data-target = \"[{&quot;key&quot;:&quot;cat&quot;,&quot;values&quot;:[&quot;technology&quot;,&quot;information-technology&quot;,&quot;data-science&quot;,&quot;big-data&quot;]},{&quot;key&quot;:&quot;isbn&quot;,&quot;values&quot;:[&quot;9781118504222&quot;]}]\" id=\"du-slot-632217efae5d1\"></div></div>","rightAd":"<div class=\"du-ad-region row\" id=\"article_page_right_ad\"><div class=\"du-ad-unit col-md-12\" data-slot-id=\"article_page_right_ad\" data-refreshed=\"false\" \r\n data-target = \"[{&quot;key&quot;:&quot;cat&quot;,&quot;values&quot;:[&quot;technology&quot;,&quot;information-technology&quot;,&quot;data-science&quot;,&quot;big-data&quot;]},{&quot;key&quot;:&quot;isbn&quot;,&quot;values&quot;:[&quot;9781118504222&quot;]}]\" id=\"du-slot-632217efaee69\"></div></div>"},"articleType":{"articleType":"Articles","articleList":null,"content":null,"videoInfo":{"videoId":null,"name":null,"accountId":null,"playerId":null,"thumbnailUrl":null,"description":null,"uploadDate":null}},"sponsorship":{"sponsorshipPage":false,"backgroundImage":{"src":null,"width":0,"height":0},"brandingLine":"","brandingLink":"","brandingLogo":{"src":null,"width":0,"height":0},"sponsorAd":"","sponsorEbookTitle":"","sponsorEbookLink":"","sponsorEbookImage":{"src":null,"width":0,"height":0}},"primaryLearningPath":"Advance","lifeExpectancy":null,"lifeExpectancySetFrom":null,"dummiesForKids":"no","sponsoredContent":"no","adInfo":"","adPairKey":[]},"status":"publish","visibility":"public","articleId":168988},{"headers":{"creationTime":"2016-03-26T15:31:59+00:00","modifiedTime":"2016-03-26T15:31:59+00:00","timestamp":"2022-09-14T18:05:35+00:00"},"data":{"breadcrumbs":[{"name":"Technology","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33512"},"slug":"technology","categoryId":33512},{"name":"Information Technology","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33572"},"slug":"information-technology","categoryId":33572},{"name":"Data Science","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33577"},"slug":"data-science","categoryId":33577},{"name":"Big Data","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33578"},"slug":"big-data","categoryId":33578}],"title":"Best Practices for Big Data Integration","strippedTitle":"best practices for big data integration","slug":"best-practices-for-big-data-integration","canonicalUrl":"","seo":{"metaDescription":"Many companies are exploring big data problems and coming up with some innovative solutions. Now is the time to pay attention to some best practices, or basic p","noIndex":0,"noFollow":0},"content":"<p>Many companies are exploring big data problems and coming up with some innovative solutions. Now is the time to pay attention to some <i>best practices,</i> or basic principles, that will serve you well as you begin your big data journey.</p>\n<p class=\"Remember\">In reality, big data integration fits into the overall process of integration of data across your company. Therefore, you can't simply toss aside everything you have learned from data integration of traditional data sources. The same rules apply whether you are thinking about traditional data management or big data management. </p>\n<p>Keep these key issues at the top of your priority list for big data integration:</p>\n<ul class=\"level-one\">\n <li><p class=\"first-para\"><b>Keep data quality in perspective. </b>Your emphasis on<b> </b>data quality depends on the stage of your big data analysis. Don't expect to be able to control data quality when you do your initial analysis on huge volumes of data. However, when you narrow down your big data to identify a subset that is most meaningful to your organization, this is when you need to focus on data quality. </p>\n<p class=\"child-para\">Ultimately, data quality becomes important if you want your results to be understood n context with your historical data. As your company relies more and more on analytics as a key planning tool, data quality can mean the difference between success and failure.</p>\n </li>\n <li><p class=\"first-para\"><b>Consider real-time data requirements.</b> Big data will bring streaming data to the forefront. Therefore, you will have to have a clear understanding of how you integrate data in motion into your environment for predictable analysis.</p>\n </li>\n <li><p class=\"first-para\"><b>Don't create new silos of information. </b>While so much of the emphasis around big data is focused on Hadoop and other unstructured and semi-structured sources, remember that you have to manage this data in context with the business. You will therefore need to integrate these sources with your line of business data and your data warehouse. </p>\n </li>\n</ul>","description":"<p>Many companies are exploring big data problems and coming up with some innovative solutions. Now is the time to pay attention to some <i>best practices,</i> or basic principles, that will serve you well as you begin your big data journey.</p>\n<p class=\"Remember\">In reality, big data integration fits into the overall process of integration of data across your company. Therefore, you can't simply toss aside everything you have learned from data integration of traditional data sources. The same rules apply whether you are thinking about traditional data management or big data management. </p>\n<p>Keep these key issues at the top of your priority list for big data integration:</p>\n<ul class=\"level-one\">\n <li><p class=\"first-para\"><b>Keep data quality in perspective. </b>Your emphasis on<b> </b>data quality depends on the stage of your big data analysis. Don't expect to be able to control data quality when you do your initial analysis on huge volumes of data. However, when you narrow down your big data to identify a subset that is most meaningful to your organization, this is when you need to focus on data quality. </p>\n<p class=\"child-para\">Ultimately, data quality becomes important if you want your results to be understood n context with your historical data. As your company relies more and more on analytics as a key planning tool, data quality can mean the difference between success and failure.</p>\n </li>\n <li><p class=\"first-para\"><b>Consider real-time data requirements.</b> Big data will bring streaming data to the forefront. Therefore, you will have to have a clear understanding of how you integrate data in motion into your environment for predictable analysis.</p>\n </li>\n <li><p class=\"first-para\"><b>Don't create new silos of information. </b>While so much of the emphasis around big data is focused on Hadoop and other unstructured and semi-structured sources, remember that you have to manage this data in context with the business. You will therefore need to integrate these sources with your line of business data and your data warehouse. </p>\n </li>\n</ul>","blurb":"","authors":[{"authorId":9411,"name":"Judith S. Hurwitz","slug":"judith-hurwitz","description":"Judith Hurwitz is president and CEO of Hurwitz & Associates, specializing in cloud computing, service management, information management, and business strategy.","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9411"}},{"authorId":9727,"name":"Alan Nugent","slug":"alan-nugent","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9727"}},{"authorId":9413,"name":"Fern Halper","slug":"fern-halper","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9413"}},{"authorId":9412,"name":"Marcia Kaufman","slug":"marcia-kaufman","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9412"}}],"primaryCategoryTaxonomy":{"categoryId":33578,"title":"Big Data","slug":"big-data","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33578"}},"secondaryCategoryTaxonomy":{"categoryId":0,"title":null,"slug":null,"_links":null},"tertiaryCategoryTaxonomy":{"categoryId":0,"title":null,"slug":null,"_links":null},"trendingArticles":[{"articleId":192609,"title":"How to Pray the Rosary: A Comprehensive Guide","slug":"how-to-pray-the-rosary","categoryList":["body-mind-spirit","religion-spirituality","christianity","catholicism"],"_links":{"self":"/articles/192609"}},{"articleId":208741,"title":"Kabbalah For Dummies Cheat Sheet","slug":"kabbalah-for-dummies-cheat-sheet","categoryList":["body-mind-spirit","religion-spirituality","kabbalah"],"_links":{"self":"/articles/208741"}},{"articleId":230957,"title":"Nikon D3400 For Dummies Cheat Sheet","slug":"nikon-d3400-dummies-cheat-sheet","categoryList":["home-auto-hobbies","photography"],"_links":{"self":"/articles/230957"}},{"articleId":235851,"title":"Praying the Rosary and Meditating on the Mysteries","slug":"praying-rosary-meditating-mysteries","categoryList":["body-mind-spirit","religion-spirituality","christianity","catholicism"],"_links":{"self":"/articles/235851"}},{"articleId":284787,"title":"What Your Society Says About You","slug":"what-your-society-says-about-you","categoryList":["academics-the-arts","humanities"],"_links":{"self":"/articles/284787"}}],"inThisArticle":[],"relatedArticles":{"fromBook":[{"articleId":207996,"title":"Big Data For Dummies Cheat Sheet","slug":"big-data-for-dummies-cheat-sheet","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/207996"}},{"articleId":168988,"title":"Integrate Big Data with the Traditional Data Warehouse","slug":"integrate-big-data-with-the-traditional-data-warehouse","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168988"}},{"articleId":168986,"title":"Big Data Planning Stages","slug":"big-data-planning-stages","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168986"}},{"articleId":168985,"title":"How to Analyze Big Data to Get Results","slug":"how-to-analyze-big-data-to-get-results","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168985"}},{"articleId":168984,"title":"Ten Hot Big Data Trends","slug":"ten-hot-big-data-trends","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168984"}}],"fromCategory":[{"articleId":207996,"title":"Big Data For Dummies Cheat Sheet","slug":"big-data-for-dummies-cheat-sheet","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/207996"}},{"articleId":207478,"title":"Statistics for Big Data For Dummies Cheat Sheet","slug":"statistics-for-big-data-for-dummies-cheat-sheet","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/207478"}},{"articleId":207432,"title":"Big Data for Small Business For Dummies Cheat Sheet","slug":"big-data-for-small-business-for-dummies-cheat-sheet","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/207432"}},{"articleId":168988,"title":"Integrate Big Data with the Traditional Data Warehouse","slug":"integrate-big-data-with-the-traditional-data-warehouse","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168988"}},{"articleId":168986,"title":"Big Data Planning Stages","slug":"big-data-planning-stages","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168986"}}]},"hasRelatedBookFromSearch":false,"relatedBook":{"bookId":281637,"slug":"big-data-for-dummies","isbn":"9781118504222","categoryList":["technology","information-technology","data-science","big-data"],"amazon":{"default":"https://www.amazon.com/gp/product/1118504224/ref=as_li_tl?ie=UTF8&tag=wiley01-20","ca":"https://www.amazon.ca/gp/product/1118504224/ref=as_li_tl?ie=UTF8&tag=wiley01-20","indigo_ca":"http://www.tkqlhce.com/click-9208661-13710633?url=https://www.chapters.indigo.ca/en-ca/books/product/1118504224-item.html&cjsku=978111945484","gb":"https://www.amazon.co.uk/gp/product/1118504224/ref=as_li_tl?ie=UTF8&tag=wiley01-20","de":"https://www.amazon.de/gp/product/1118504224/ref=as_li_tl?ie=UTF8&tag=wiley01-20"},"image":{"src":"https://www.dummies.com/wp-content/uploads/big-data-for-dummies-cover-9781118504222-203x255.jpg","width":203,"height":255},"title":"Big Data For Dummies","testBankPinActivationLink":"","bookOutOfPrint":false,"authorsInfo":"<p><b data-author-id=\"34961\">Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy. Alan Nugent has extensive experience in cloud-based big data solutions. Dr. Fern Halper specializes in big data and analytics. Marcia Kaufman specializes in cloud infrastructure, information management, and analytics.</p>","authors":[{"authorId":9727,"name":"Alan Nugent","slug":"alan-nugent","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9727"}},{"authorId":9413,"name":"Fern Halper","slug":"fern-halper","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9413"}},{"authorId":34961,"name":"Judith S. Hurwitz","slug":"judith-s-hurwitz","description":" <p><b>Daniel Kirsch,</b> Managing Director of Hurwitz &#38; Associates, is a thought leader, researcher, author, and consultant in cloud, AI, and security. <b>Judith Hurwitz,</b> President of Hurwitz &#38; Associates, is a consultant, thought leader, and coauthor of 10 books including <i>Augmented Intelligence, Cognitive Computing and Big Data Analytics,</i> and <i>Hybrid Cloud for Dummies</i> ","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/34961"}},{"authorId":9412,"name":"Marcia Kaufman","slug":"marcia-kaufman","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9412"}}],"_links":{"self":"https://dummies-api.dummies.com/v2/books/"}},"collections":[],"articleAds":{"footerAd":"<div class=\"du-ad-region row\" id=\"article_page_adhesion_ad\"><div class=\"du-ad-unit col-md-12\" data-slot-id=\"article_page_adhesion_ad\" data-refreshed=\"false\" \r\n data-target = \"[{&quot;key&quot;:&quot;cat&quot;,&quot;values&quot;:[&quot;technology&quot;,&quot;information-technology&quot;,&quot;data-science&quot;,&quot;big-data&quot;]},{&quot;key&quot;:&quot;isbn&quot;,&quot;values&quot;:[&quot;9781118504222&quot;]}]\" id=\"du-slot-632217ef9e8c2\"></div></div>","rightAd":"<div class=\"du-ad-region row\" id=\"article_page_right_ad\"><div class=\"du-ad-unit col-md-12\" data-slot-id=\"article_page_right_ad\" data-refreshed=\"false\" \r\n data-target = \"[{&quot;key&quot;:&quot;cat&quot;,&quot;values&quot;:[&quot;technology&quot;,&quot;information-technology&quot;,&quot;data-science&quot;,&quot;big-data&quot;]},{&quot;key&quot;:&quot;isbn&quot;,&quot;values&quot;:[&quot;9781118504222&quot;]}]\" id=\"du-slot-632217ef9f133\"></div></div>"},"articleType":{"articleType":"Articles","articleList":null,"content":null,"videoInfo":{"videoId":null,"name":null,"accountId":null,"playerId":null,"thumbnailUrl":null,"description":null,"uploadDate":null}},"sponsorship":{"sponsorshipPage":false,"backgroundImage":{"src":null,"width":0,"height":0},"brandingLine":"","brandingLink":"","brandingLogo":{"src":null,"width":0,"height":0},"sponsorAd":"","sponsorEbookTitle":"","sponsorEbookLink":"","sponsorEbookImage":{"src":null,"width":0,"height":0}},"primaryLearningPath":"Advance","lifeExpectancy":null,"lifeExpectancySetFrom":null,"dummiesForKids":"no","sponsoredContent":"no","adInfo":"","adPairKey":[]},"status":"publish","visibility":"public","articleId":168987},{"headers":{"creationTime":"2016-03-26T15:31:59+00:00","modifiedTime":"2016-03-26T15:31:59+00:00","timestamp":"2022-09-14T18:05:35+00:00"},"data":{"breadcrumbs":[{"name":"Technology","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33512"},"slug":"technology","categoryId":33512},{"name":"Information Technology","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33572"},"slug":"information-technology","categoryId":33572},{"name":"Data Science","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33577"},"slug":"data-science","categoryId":33577},{"name":"Big Data","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33578"},"slug":"big-data","categoryId":33578}],"title":"Big Data Planning Stages","strippedTitle":"big data planning stages","slug":"big-data-planning-stages","canonicalUrl":"","seo":{"metaDescription":"Four stages are part of the planning process that applies to big data. As more businesses begin to use the cloud as a way to deploy new and innovative services ","noIndex":0,"noFollow":0},"content":"<p>Four stages are part of the planning process that applies to big data. As more businesses begin to use the cloud as a way to deploy new and innovative services to customers, the role of data analysis will explode. Therefore, consider another part of your planning process and add three more stages to your data cycle.</p>\n<ul class=\"level-one\">\n <li><p class=\"first-para\"><b>Stage 1: Planning with data: </b>The only way to make sure that business leaders are taking a balanced perspective on all the elements of the business is to have a clear understanding of how data sources are related. The business needs a road map for determining what data is needed to plan for new strategies and new directions.</p>\n </li>\n <li><p class=\"first-para\"><b>Stage 2: Doing the analysis:</b> Executing on big data analysis requires learning a set of new tools and new skills. Many organizations will need to hire some big data scientists who can understand how to take this massive amount of disparate data and begin to understand how all the data elements relate in the context of the business problem or opportunity.</p>\n </li>\n <li><p class=\"first-para\"><b>Stage 3: Checking the results: </b>Make sure you aren’t relying on data sources that will take you in the wrong direction. Many companies will use third-party data sources and may not take the time to vet the quality of the data, but you have to make sure that you are on a strong foundation.</p>\n </li>\n <li><p class=\"first-para\"><b>Stage 4: Acting on the plan:</b> Each time a business initiates a new strategy, it is critical to constantly create a big data business evaluation cycle. This approach of acting based on results of big data analytics and then testing the results of executing business strategy is the key to success. </p>\n </li>\n <li><p class=\"first-para\"><b>Stage 5: Monitoring in real time:</b> Big data analytics enables you to monitor data in near real time proactively. This can have a profound impact on your business. If you are a pharmaceutical company conducting a clinical trial, you may be able to adjust or cancel a trial to avoid a lawsuit. </p>\n </li>\n <li><p class=\"first-para\"><b>Stage 6: Adjusting the impact: </b>When your company has the tools to monitor continuously, it is possible to adjust processes and strategy based on data analytics. Being able to monitor quickly means that a process can be changed earlier and result in better overall quality.</p>\n </li>\n <li><p class=\"first-para\"><b>Stage 7: Enabling experimentation:</b> Combining experimentation with real-time monitoring and rapid adjustment can transform a business strategy. You have less risk with experimentation because you can change directions and outcomes more easily if you are armed with the right data.</p>\n </li>\n</ul>\n<p class=\"Remember\">The greatest challenge for the business is to be able to look into the future and anticipate what might change and why. Companies want to be able to make informed decisions in a faster and more efficient manner. The business wants to apply that knowledge to take action that can change business outcomes. Leaders also need to understand the nuances of the business impacts that are across product lines and their partner ecosystem. The best businesses take a holistic approach to data.</p>","description":"<p>Four stages are part of the planning process that applies to big data. As more businesses begin to use the cloud as a way to deploy new and innovative services to customers, the role of data analysis will explode. Therefore, consider another part of your planning process and add three more stages to your data cycle.</p>\n<ul class=\"level-one\">\n <li><p class=\"first-para\"><b>Stage 1: Planning with data: </b>The only way to make sure that business leaders are taking a balanced perspective on all the elements of the business is to have a clear understanding of how data sources are related. The business needs a road map for determining what data is needed to plan for new strategies and new directions.</p>\n </li>\n <li><p class=\"first-para\"><b>Stage 2: Doing the analysis:</b> Executing on big data analysis requires learning a set of new tools and new skills. Many organizations will need to hire some big data scientists who can understand how to take this massive amount of disparate data and begin to understand how all the data elements relate in the context of the business problem or opportunity.</p>\n </li>\n <li><p class=\"first-para\"><b>Stage 3: Checking the results: </b>Make sure you aren’t relying on data sources that will take you in the wrong direction. Many companies will use third-party data sources and may not take the time to vet the quality of the data, but you have to make sure that you are on a strong foundation.</p>\n </li>\n <li><p class=\"first-para\"><b>Stage 4: Acting on the plan:</b> Each time a business initiates a new strategy, it is critical to constantly create a big data business evaluation cycle. This approach of acting based on results of big data analytics and then testing the results of executing business strategy is the key to success. </p>\n </li>\n <li><p class=\"first-para\"><b>Stage 5: Monitoring in real time:</b> Big data analytics enables you to monitor data in near real time proactively. This can have a profound impact on your business. If you are a pharmaceutical company conducting a clinical trial, you may be able to adjust or cancel a trial to avoid a lawsuit. </p>\n </li>\n <li><p class=\"first-para\"><b>Stage 6: Adjusting the impact: </b>When your company has the tools to monitor continuously, it is possible to adjust processes and strategy based on data analytics. Being able to monitor quickly means that a process can be changed earlier and result in better overall quality.</p>\n </li>\n <li><p class=\"first-para\"><b>Stage 7: Enabling experimentation:</b> Combining experimentation with real-time monitoring and rapid adjustment can transform a business strategy. You have less risk with experimentation because you can change directions and outcomes more easily if you are armed with the right data.</p>\n </li>\n</ul>\n<p class=\"Remember\">The greatest challenge for the business is to be able to look into the future and anticipate what might change and why. Companies want to be able to make informed decisions in a faster and more efficient manner. The business wants to apply that knowledge to take action that can change business outcomes. Leaders also need to understand the nuances of the business impacts that are across product lines and their partner ecosystem. The best businesses take a holistic approach to data.</p>","blurb":"","authors":[{"authorId":9411,"name":"Judith S. Hurwitz","slug":"judith-hurwitz","description":"Judith Hurwitz is president and CEO of Hurwitz & Associates, specializing in cloud computing, service management, information management, and business strategy.","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9411"}},{"authorId":9727,"name":"Alan Nugent","slug":"alan-nugent","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9727"}},{"authorId":9413,"name":"Fern Halper","slug":"fern-halper","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9413"}},{"authorId":9412,"name":"Marcia Kaufman","slug":"marcia-kaufman","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9412"}}],"primaryCategoryTaxonomy":{"categoryId":33578,"title":"Big Data","slug":"big-data","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33578"}},"secondaryCategoryTaxonomy":{"categoryId":0,"title":null,"slug":null,"_links":null},"tertiaryCategoryTaxonomy":{"categoryId":0,"title":null,"slug":null,"_links":null},"trendingArticles":[{"articleId":192609,"title":"How to Pray the Rosary: A Comprehensive Guide","slug":"how-to-pray-the-rosary","categoryList":["body-mind-spirit","religion-spirituality","christianity","catholicism"],"_links":{"self":"/articles/192609"}},{"articleId":208741,"title":"Kabbalah For Dummies Cheat Sheet","slug":"kabbalah-for-dummies-cheat-sheet","categoryList":["body-mind-spirit","religion-spirituality","kabbalah"],"_links":{"self":"/articles/208741"}},{"articleId":230957,"title":"Nikon D3400 For Dummies Cheat Sheet","slug":"nikon-d3400-dummies-cheat-sheet","categoryList":["home-auto-hobbies","photography"],"_links":{"self":"/articles/230957"}},{"articleId":235851,"title":"Praying the Rosary and Meditating on the Mysteries","slug":"praying-rosary-meditating-mysteries","categoryList":["body-mind-spirit","religion-spirituality","christianity","catholicism"],"_links":{"self":"/articles/235851"}},{"articleId":284787,"title":"What Your Society Says About You","slug":"what-your-society-says-about-you","categoryList":["academics-the-arts","humanities"],"_links":{"self":"/articles/284787"}}],"inThisArticle":[],"relatedArticles":{"fromBook":[{"articleId":207996,"title":"Big Data For Dummies Cheat Sheet","slug":"big-data-for-dummies-cheat-sheet","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/207996"}},{"articleId":168988,"title":"Integrate Big Data with the Traditional Data Warehouse","slug":"integrate-big-data-with-the-traditional-data-warehouse","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168988"}},{"articleId":168987,"title":"Best Practices for Big Data Integration","slug":"best-practices-for-big-data-integration","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168987"}},{"articleId":168985,"title":"How to Analyze Big Data to Get Results","slug":"how-to-analyze-big-data-to-get-results","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168985"}},{"articleId":168984,"title":"Ten Hot Big Data Trends","slug":"ten-hot-big-data-trends","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168984"}}],"fromCategory":[{"articleId":207996,"title":"Big Data For Dummies Cheat Sheet","slug":"big-data-for-dummies-cheat-sheet","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/207996"}},{"articleId":207478,"title":"Statistics for Big Data For Dummies Cheat Sheet","slug":"statistics-for-big-data-for-dummies-cheat-sheet","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/207478"}},{"articleId":207432,"title":"Big Data for Small Business For Dummies Cheat Sheet","slug":"big-data-for-small-business-for-dummies-cheat-sheet","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/207432"}},{"articleId":168988,"title":"Integrate Big Data with the Traditional Data Warehouse","slug":"integrate-big-data-with-the-traditional-data-warehouse","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168988"}},{"articleId":168987,"title":"Best Practices for Big Data Integration","slug":"best-practices-for-big-data-integration","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168987"}}]},"hasRelatedBookFromSearch":false,"relatedBook":{"bookId":281637,"slug":"big-data-for-dummies","isbn":"9781118504222","categoryList":["technology","information-technology","data-science","big-data"],"amazon":{"default":"https://www.amazon.com/gp/product/1118504224/ref=as_li_tl?ie=UTF8&tag=wiley01-20","ca":"https://www.amazon.ca/gp/product/1118504224/ref=as_li_tl?ie=UTF8&tag=wiley01-20","indigo_ca":"http://www.tkqlhce.com/click-9208661-13710633?url=https://www.chapters.indigo.ca/en-ca/books/product/1118504224-item.html&cjsku=978111945484","gb":"https://www.amazon.co.uk/gp/product/1118504224/ref=as_li_tl?ie=UTF8&tag=wiley01-20","de":"https://www.amazon.de/gp/product/1118504224/ref=as_li_tl?ie=UTF8&tag=wiley01-20"},"image":{"src":"https://www.dummies.com/wp-content/uploads/big-data-for-dummies-cover-9781118504222-203x255.jpg","width":203,"height":255},"title":"Big Data For Dummies","testBankPinActivationLink":"","bookOutOfPrint":false,"authorsInfo":"<p><b data-author-id=\"34961\">Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy. Alan Nugent has extensive experience in cloud-based big data solutions. Dr. Fern Halper specializes in big data and analytics. Marcia Kaufman specializes in cloud infrastructure, information management, and analytics.</p>","authors":[{"authorId":9727,"name":"Alan Nugent","slug":"alan-nugent","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9727"}},{"authorId":9413,"name":"Fern Halper","slug":"fern-halper","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9413"}},{"authorId":34961,"name":"Judith S. Hurwitz","slug":"judith-s-hurwitz","description":" <p><b>Daniel Kirsch,</b> Managing Director of Hurwitz &#38; Associates, is a thought leader, researcher, author, and consultant in cloud, AI, and security. <b>Judith Hurwitz,</b> President of Hurwitz &#38; Associates, is a consultant, thought leader, and coauthor of 10 books including <i>Augmented Intelligence, Cognitive Computing and Big Data Analytics,</i> and <i>Hybrid Cloud for Dummies</i> ","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/34961"}},{"authorId":9412,"name":"Marcia Kaufman","slug":"marcia-kaufman","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9412"}}],"_links":{"self":"https://dummies-api.dummies.com/v2/books/"}},"collections":[],"articleAds":{"footerAd":"<div class=\"du-ad-region row\" id=\"article_page_adhesion_ad\"><div class=\"du-ad-unit col-md-12\" data-slot-id=\"article_page_adhesion_ad\" data-refreshed=\"false\" \r\n data-target = \"[{&quot;key&quot;:&quot;cat&quot;,&quot;values&quot;:[&quot;technology&quot;,&quot;information-technology&quot;,&quot;data-science&quot;,&quot;big-data&quot;]},{&quot;key&quot;:&quot;isbn&quot;,&quot;values&quot;:[&quot;9781118504222&quot;]}]\" id=\"du-slot-632217ef96a23\"></div></div>","rightAd":"<div class=\"du-ad-region row\" id=\"article_page_right_ad\"><div class=\"du-ad-unit col-md-12\" data-slot-id=\"article_page_right_ad\" data-refreshed=\"false\" \r\n data-target = \"[{&quot;key&quot;:&quot;cat&quot;,&quot;values&quot;:[&quot;technology&quot;,&quot;information-technology&quot;,&quot;data-science&quot;,&quot;big-data&quot;]},{&quot;key&quot;:&quot;isbn&quot;,&quot;values&quot;:[&quot;9781118504222&quot;]}]\" id=\"du-slot-632217ef972c5\"></div></div>"},"articleType":{"articleType":"Articles","articleList":null,"content":null,"videoInfo":{"videoId":null,"name":null,"accountId":null,"playerId":null,"thumbnailUrl":null,"description":null,"uploadDate":null}},"sponsorship":{"sponsorshipPage":false,"backgroundImage":{"src":null,"width":0,"height":0},"brandingLine":"","brandingLink":"","brandingLogo":{"src":null,"width":0,"height":0},"sponsorAd":"","sponsorEbookTitle":"","sponsorEbookLink":"","sponsorEbookImage":{"src":null,"width":0,"height":0}},"primaryLearningPath":"Advance","lifeExpectancy":null,"lifeExpectancySetFrom":null,"dummiesForKids":"no","sponsoredContent":"no","adInfo":"","adPairKey":[]},"status":"publish","visibility":"public","articleId":168986},{"headers":{"creationTime":"2016-03-26T15:31:59+00:00","modifiedTime":"2016-03-26T15:31:59+00:00","timestamp":"2022-09-14T18:05:35+00:00"},"data":{"breadcrumbs":[{"name":"Technology","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33512"},"slug":"technology","categoryId":33512},{"name":"Information Technology","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33572"},"slug":"information-technology","categoryId":33572},{"name":"Data Science","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33577"},"slug":"data-science","categoryId":33577},{"name":"Big Data","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33578"},"slug":"big-data","categoryId":33578}],"title":"How to Analyze Big Data to Get Results","strippedTitle":"how to analyze big data to get results","slug":"how-to-analyze-big-data-to-get-results","canonicalUrl":"","seo":{"metaDescription":"Big data is most useful if you can do something with it, but how do you analyze it? Companies like Amazon and Google are masters at analyzing big data. And they","noIndex":0,"noFollow":0},"content":"<p>Big data is most useful if you can do something with it, but how do you analyze it? Companies like Amazon and Google are masters at analyzing big data. And they use the resulting knowledge to gain a competitive advantage.</p>\n<p>Just think about Amazon's recommendation engine. The company takes all your buying history together with what it knows about you, your buying patterns, and the buying patterns of people like you to come up with some pretty good suggestions. It's a marketing machine, and its big data analytics capabilities have made it extremely successful.</p>\n<p class=\"Remember\">The ability to analyze big data provides unique opportunities for your organization as well. You'll be able to expand the kind of analysis you can do. Instead of being limited to sampling large data sets, you can now use much more detailed and complete data to do your analysis. However, analyzing big data can also be challenging. Changing algorithms and technology, even for basic data analysis, often has to be addressed with big data.</p>\n<p>The first question that you need to ask yourself before you dive into big data analysis is what problem are you trying to solve? You may not even be sure of what you are looking for. You know you have lots of data that you think you can get valuable insight from. And certainly, patterns can emerge from that data before you understand why they are there.</p>\n<p>If you think about it though, you're sure to have an idea of what you're interested in. For instance, are you interested in predicting customer behavior to prevent churn? Do you want to analyze the driving patterns of your customers for insurance premium purposes? Are you interested in looking at your system log data to ultimately predict when problems might occur? The kind of high-level problem is going to drive the analytics you decide to use. </p>\n<p>Alternately, if you're not exactly sure of the business problem you're trying to solve, maybe you need to look at areas in your business that need improvement. Even an analytics-driven strategy — targeted at the right area — can provide useful results with big data. When it comes to analytics, you might consider a range of possible kinds, which are briefly outlined in the table.</p>\n<table>\n<tr>\n<th>Analysis Type</th>\n<th>Description</th>\n</tr>\n<tr>\n<td>Basic analytics for insight</td>\n<td>Slicing and dicing of data, reporting, simple visualizations,\nbasic monitoring.</td>\n</tr>\n<tr>\n<td>Advanced analytics for insight</td>\n<td>More complex analysis such as predictive modeling and other\npattern-matching techniques.</td>\n</tr>\n<tr>\n<td>Operationalized analytics</td>\n<td>Analytics become part of the business process.</td>\n</tr>\n<tr>\n<td>Monetized analytics</td>\n<td>Analytics are utilized to directly drive revenue.</td>\n</tr>\n</table>","description":"<p>Big data is most useful if you can do something with it, but how do you analyze it? Companies like Amazon and Google are masters at analyzing big data. And they use the resulting knowledge to gain a competitive advantage.</p>\n<p>Just think about Amazon's recommendation engine. The company takes all your buying history together with what it knows about you, your buying patterns, and the buying patterns of people like you to come up with some pretty good suggestions. It's a marketing machine, and its big data analytics capabilities have made it extremely successful.</p>\n<p class=\"Remember\">The ability to analyze big data provides unique opportunities for your organization as well. You'll be able to expand the kind of analysis you can do. Instead of being limited to sampling large data sets, you can now use much more detailed and complete data to do your analysis. However, analyzing big data can also be challenging. Changing algorithms and technology, even for basic data analysis, often has to be addressed with big data.</p>\n<p>The first question that you need to ask yourself before you dive into big data analysis is what problem are you trying to solve? You may not even be sure of what you are looking for. You know you have lots of data that you think you can get valuable insight from. And certainly, patterns can emerge from that data before you understand why they are there.</p>\n<p>If you think about it though, you're sure to have an idea of what you're interested in. For instance, are you interested in predicting customer behavior to prevent churn? Do you want to analyze the driving patterns of your customers for insurance premium purposes? Are you interested in looking at your system log data to ultimately predict when problems might occur? The kind of high-level problem is going to drive the analytics you decide to use. </p>\n<p>Alternately, if you're not exactly sure of the business problem you're trying to solve, maybe you need to look at areas in your business that need improvement. Even an analytics-driven strategy — targeted at the right area — can provide useful results with big data. When it comes to analytics, you might consider a range of possible kinds, which are briefly outlined in the table.</p>\n<table>\n<tr>\n<th>Analysis Type</th>\n<th>Description</th>\n</tr>\n<tr>\n<td>Basic analytics for insight</td>\n<td>Slicing and dicing of data, reporting, simple visualizations,\nbasic monitoring.</td>\n</tr>\n<tr>\n<td>Advanced analytics for insight</td>\n<td>More complex analysis such as predictive modeling and other\npattern-matching techniques.</td>\n</tr>\n<tr>\n<td>Operationalized analytics</td>\n<td>Analytics become part of the business process.</td>\n</tr>\n<tr>\n<td>Monetized analytics</td>\n<td>Analytics are utilized to directly drive revenue.</td>\n</tr>\n</table>","blurb":"","authors":[{"authorId":9411,"name":"Judith S. Hurwitz","slug":"judith-hurwitz","description":"Judith Hurwitz is president and CEO of Hurwitz & Associates, specializing in cloud computing, service management, information management, and business strategy.","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9411"}},{"authorId":9727,"name":"Alan Nugent","slug":"alan-nugent","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9727"}},{"authorId":9413,"name":"Fern Halper","slug":"fern-halper","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9413"}},{"authorId":9412,"name":"Marcia Kaufman","slug":"marcia-kaufman","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9412"}}],"primaryCategoryTaxonomy":{"categoryId":33578,"title":"Big Data","slug":"big-data","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33578"}},"secondaryCategoryTaxonomy":{"categoryId":0,"title":null,"slug":null,"_links":null},"tertiaryCategoryTaxonomy":{"categoryId":0,"title":null,"slug":null,"_links":null},"trendingArticles":[{"articleId":192609,"title":"How to Pray the Rosary: A Comprehensive Guide","slug":"how-to-pray-the-rosary","categoryList":["body-mind-spirit","religion-spirituality","christianity","catholicism"],"_links":{"self":"/articles/192609"}},{"articleId":208741,"title":"Kabbalah For Dummies Cheat Sheet","slug":"kabbalah-for-dummies-cheat-sheet","categoryList":["body-mind-spirit","religion-spirituality","kabbalah"],"_links":{"self":"/articles/208741"}},{"articleId":230957,"title":"Nikon D3400 For Dummies Cheat Sheet","slug":"nikon-d3400-dummies-cheat-sheet","categoryList":["home-auto-hobbies","photography"],"_links":{"self":"/articles/230957"}},{"articleId":235851,"title":"Praying the Rosary and Meditating on the Mysteries","slug":"praying-rosary-meditating-mysteries","categoryList":["body-mind-spirit","religion-spirituality","christianity","catholicism"],"_links":{"self":"/articles/235851"}},{"articleId":284787,"title":"What Your Society Says About You","slug":"what-your-society-says-about-you","categoryList":["academics-the-arts","humanities"],"_links":{"self":"/articles/284787"}}],"inThisArticle":[],"relatedArticles":{"fromBook":[{"articleId":207996,"title":"Big Data For Dummies Cheat Sheet","slug":"big-data-for-dummies-cheat-sheet","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/207996"}},{"articleId":168988,"title":"Integrate Big Data with the Traditional Data Warehouse","slug":"integrate-big-data-with-the-traditional-data-warehouse","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168988"}},{"articleId":168987,"title":"Best Practices for Big Data Integration","slug":"best-practices-for-big-data-integration","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168987"}},{"articleId":168986,"title":"Big Data Planning Stages","slug":"big-data-planning-stages","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168986"}},{"articleId":168984,"title":"Ten Hot Big Data Trends","slug":"ten-hot-big-data-trends","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168984"}}],"fromCategory":[{"articleId":207996,"title":"Big Data For Dummies Cheat Sheet","slug":"big-data-for-dummies-cheat-sheet","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/207996"}},{"articleId":207478,"title":"Statistics for Big Data For Dummies Cheat Sheet","slug":"statistics-for-big-data-for-dummies-cheat-sheet","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/207478"}},{"articleId":207432,"title":"Big Data for Small Business For Dummies Cheat Sheet","slug":"big-data-for-small-business-for-dummies-cheat-sheet","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/207432"}},{"articleId":168988,"title":"Integrate Big Data with the Traditional Data Warehouse","slug":"integrate-big-data-with-the-traditional-data-warehouse","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168988"}},{"articleId":168987,"title":"Best Practices for Big Data Integration","slug":"best-practices-for-big-data-integration","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168987"}}]},"hasRelatedBookFromSearch":false,"relatedBook":{"bookId":281637,"slug":"big-data-for-dummies","isbn":"9781118504222","categoryList":["technology","information-technology","data-science","big-data"],"amazon":{"default":"https://www.amazon.com/gp/product/1118504224/ref=as_li_tl?ie=UTF8&tag=wiley01-20","ca":"https://www.amazon.ca/gp/product/1118504224/ref=as_li_tl?ie=UTF8&tag=wiley01-20","indigo_ca":"http://www.tkqlhce.com/click-9208661-13710633?url=https://www.chapters.indigo.ca/en-ca/books/product/1118504224-item.html&cjsku=978111945484","gb":"https://www.amazon.co.uk/gp/product/1118504224/ref=as_li_tl?ie=UTF8&tag=wiley01-20","de":"https://www.amazon.de/gp/product/1118504224/ref=as_li_tl?ie=UTF8&tag=wiley01-20"},"image":{"src":"https://www.dummies.com/wp-content/uploads/big-data-for-dummies-cover-9781118504222-203x255.jpg","width":203,"height":255},"title":"Big Data For Dummies","testBankPinActivationLink":"","bookOutOfPrint":false,"authorsInfo":"<p><b data-author-id=\"34961\">Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy. Alan Nugent has extensive experience in cloud-based big data solutions. Dr. Fern Halper specializes in big data and analytics. Marcia Kaufman specializes in cloud infrastructure, information management, and analytics.</p>","authors":[{"authorId":9727,"name":"Alan Nugent","slug":"alan-nugent","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9727"}},{"authorId":9413,"name":"Fern Halper","slug":"fern-halper","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9413"}},{"authorId":34961,"name":"Judith S. Hurwitz","slug":"judith-s-hurwitz","description":" <p><b>Daniel Kirsch,</b> Managing Director of Hurwitz &#38; Associates, is a thought leader, researcher, author, and consultant in cloud, AI, and security. <b>Judith Hurwitz,</b> President of Hurwitz &#38; Associates, is a consultant, thought leader, and coauthor of 10 books including <i>Augmented Intelligence, Cognitive Computing and Big Data Analytics,</i> and <i>Hybrid Cloud for Dummies</i> ","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/34961"}},{"authorId":9412,"name":"Marcia Kaufman","slug":"marcia-kaufman","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9412"}}],"_links":{"self":"https://dummies-api.dummies.com/v2/books/"}},"collections":[],"articleAds":{"footerAd":"<div class=\"du-ad-region row\" id=\"article_page_adhesion_ad\"><div class=\"du-ad-unit col-md-12\" data-slot-id=\"article_page_adhesion_ad\" data-refreshed=\"false\" \r\n data-target = \"[{&quot;key&quot;:&quot;cat&quot;,&quot;values&quot;:[&quot;technology&quot;,&quot;information-technology&quot;,&quot;data-science&quot;,&quot;big-data&quot;]},{&quot;key&quot;:&quot;isbn&quot;,&quot;values&quot;:[&quot;9781118504222&quot;]}]\" id=\"du-slot-632217efa689a\"></div></div>","rightAd":"<div class=\"du-ad-region row\" id=\"article_page_right_ad\"><div class=\"du-ad-unit col-md-12\" data-slot-id=\"article_page_right_ad\" data-refreshed=\"false\" \r\n data-target = \"[{&quot;key&quot;:&quot;cat&quot;,&quot;values&quot;:[&quot;technology&quot;,&quot;information-technology&quot;,&quot;data-science&quot;,&quot;big-data&quot;]},{&quot;key&quot;:&quot;isbn&quot;,&quot;values&quot;:[&quot;9781118504222&quot;]}]\" id=\"du-slot-632217efa7111\"></div></div>"},"articleType":{"articleType":"Articles","articleList":null,"content":null,"videoInfo":{"videoId":null,"name":null,"accountId":null,"playerId":null,"thumbnailUrl":null,"description":null,"uploadDate":null}},"sponsorship":{"sponsorshipPage":false,"backgroundImage":{"src":null,"width":0,"height":0},"brandingLine":"","brandingLink":"","brandingLogo":{"src":null,"width":0,"height":0},"sponsorAd":"","sponsorEbookTitle":"","sponsorEbookLink":"","sponsorEbookImage":{"src":null,"width":0,"height":0}},"primaryLearningPath":"Advance","lifeExpectancy":null,"lifeExpectancySetFrom":null,"dummiesForKids":"no","sponsoredContent":"no","adInfo":"","adPairKey":[]},"status":"publish","visibility":"public","articleId":168985},{"headers":{"creationTime":"2016-03-26T15:31:58+00:00","modifiedTime":"2016-03-26T15:31:58+00:00","timestamp":"2022-09-14T18:05:35+00:00"},"data":{"breadcrumbs":[{"name":"Technology","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33512"},"slug":"technology","categoryId":33512},{"name":"Information Technology","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33572"},"slug":"information-technology","categoryId":33572},{"name":"Data Science","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33577"},"slug":"data-science","categoryId":33577},{"name":"Big Data","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33578"},"slug":"big-data","categoryId":33578}],"title":"Ten Hot Big Data Trends","strippedTitle":"ten hot big data trends","slug":"ten-hot-big-data-trends","canonicalUrl":"","seo":{"metaDescription":"As you enter the world of big data, you'll need to absorb many new types of database and data-management technologies. Here are the top-ten big data trends: Had","noIndex":0,"noFollow":0},"content":"<p>As you enter the world of big data, you'll need to absorb many new types of database and data-management technologies. Here are the top-ten big data trends:</p>\n<ul class=\"level-one\">\n <li><p class=\"first-para\"><b>Hadoop is becoming the underpinning for distributed big data management. </b>Hadoop is a distributed file system that can be used in conjunction with MapReduce to process and analyze massive amounts of data, enabling the big data trend. Hadoop will be tightly integrated into data warehousing technologies so that structured and unstructured data can be integrated more effectively.</p>\n </li>\n <li><p class=\"first-para\"><b>Big data makes it possible to leverage data from sensors to change business outcomes. </b>More and more businesses are using highly sophisticated sensors on the equipment that runs their operations. New innovations in big data technology are making it possible to analyze all this data to get advanced notification of problems that can be fixed to protect the business.</p>\n </li>\n <li><p class=\"first-para\"><b>Big data can help a business initiative become a real-time action to increase revenue. </b>Companies in markets such as retail are using real-time streaming data analytics to keep track of customer actions and offer incentives to increase revenue per customer.</p>\n </li>\n <li><p class=\"first-para\"><b>Big data can be integrated with historical data warehouses to transform planning.</b> Big data can provide a company with a better understanding of massive amounts of data about their business. This information about the current state of the business can be combined with historical data to get a full view of the context for business change.</p>\n </li>\n <li><p class=\"first-para\"><b>Big data can change the way diseases are managed by adding predictive analytics.</b> Increasingly, healthcare practitioners are looking to big data solutions to gain insights into disease by compare symptoms and test results to databases of results from hundreds of thousands of other cases. This allows practitioners to more quickly predict outcomes and save lives.</p>\n </li>\n <li><p class=\"first-para\"><b>Cloud computing will transform the way that data will be managed in the future.</b> Cloud computing is invaluable as a tool to support the expansion of big data. Increasingly, cloud services that are optimized for data will mean that many more services and delivery models will make big data more practical for companies of all sizes.</p>\n </li>\n <li><p class=\"first-para\"><b>Security and governance will be the difference between success and failure of businesses leveraging big data. </b>Big data can be a huge benefit, but it isn't risk-free. Companies will discover that if they are not careful, it is possible to expose private information through big data analysis. Companies need to balance the need to analyze results with best practices for security and governance.</p>\n </li>\n <li><p class=\"first-para\"><b>Veracity, or truthfulness, of big data will become the most important issue for the coming year.</b> Many companies can get carried away with the ability to analyze massive amounts of data and get back compelling results that predict business outcomes. Therefore, companies will find that the truthfulness of the data must become a top priority or decision making will suffer.</p>\n </li>\n <li><p class=\"first-para\"><b>As big data moves out of the experimental stage, more packaged offerings will be developed. </b>Most big data projects initiated over the past few years have been experimental. Companies are cautiously working with new tools and technology. Now big data is about to enter the mainstream. Lots of packaged big data offerings will flood the market.</p>\n </li>\n <li><p class=\"first-para\"><b>Use cases and new innovative ways to apply big data will explode.</b> Early successes with big data in different industries such as manufacturing, retail, and healthcare will lead to many more industries looking at ways to leverage massive amounts of data to transform their industries.</p>\n </li>\n</ul>","description":"<p>As you enter the world of big data, you'll need to absorb many new types of database and data-management technologies. Here are the top-ten big data trends:</p>\n<ul class=\"level-one\">\n <li><p class=\"first-para\"><b>Hadoop is becoming the underpinning for distributed big data management. </b>Hadoop is a distributed file system that can be used in conjunction with MapReduce to process and analyze massive amounts of data, enabling the big data trend. Hadoop will be tightly integrated into data warehousing technologies so that structured and unstructured data can be integrated more effectively.</p>\n </li>\n <li><p class=\"first-para\"><b>Big data makes it possible to leverage data from sensors to change business outcomes. </b>More and more businesses are using highly sophisticated sensors on the equipment that runs their operations. New innovations in big data technology are making it possible to analyze all this data to get advanced notification of problems that can be fixed to protect the business.</p>\n </li>\n <li><p class=\"first-para\"><b>Big data can help a business initiative become a real-time action to increase revenue. </b>Companies in markets such as retail are using real-time streaming data analytics to keep track of customer actions and offer incentives to increase revenue per customer.</p>\n </li>\n <li><p class=\"first-para\"><b>Big data can be integrated with historical data warehouses to transform planning.</b> Big data can provide a company with a better understanding of massive amounts of data about their business. This information about the current state of the business can be combined with historical data to get a full view of the context for business change.</p>\n </li>\n <li><p class=\"first-para\"><b>Big data can change the way diseases are managed by adding predictive analytics.</b> Increasingly, healthcare practitioners are looking to big data solutions to gain insights into disease by compare symptoms and test results to databases of results from hundreds of thousands of other cases. This allows practitioners to more quickly predict outcomes and save lives.</p>\n </li>\n <li><p class=\"first-para\"><b>Cloud computing will transform the way that data will be managed in the future.</b> Cloud computing is invaluable as a tool to support the expansion of big data. Increasingly, cloud services that are optimized for data will mean that many more services and delivery models will make big data more practical for companies of all sizes.</p>\n </li>\n <li><p class=\"first-para\"><b>Security and governance will be the difference between success and failure of businesses leveraging big data. </b>Big data can be a huge benefit, but it isn't risk-free. Companies will discover that if they are not careful, it is possible to expose private information through big data analysis. Companies need to balance the need to analyze results with best practices for security and governance.</p>\n </li>\n <li><p class=\"first-para\"><b>Veracity, or truthfulness, of big data will become the most important issue for the coming year.</b> Many companies can get carried away with the ability to analyze massive amounts of data and get back compelling results that predict business outcomes. Therefore, companies will find that the truthfulness of the data must become a top priority or decision making will suffer.</p>\n </li>\n <li><p class=\"first-para\"><b>As big data moves out of the experimental stage, more packaged offerings will be developed. </b>Most big data projects initiated over the past few years have been experimental. Companies are cautiously working with new tools and technology. Now big data is about to enter the mainstream. Lots of packaged big data offerings will flood the market.</p>\n </li>\n <li><p class=\"first-para\"><b>Use cases and new innovative ways to apply big data will explode.</b> Early successes with big data in different industries such as manufacturing, retail, and healthcare will lead to many more industries looking at ways to leverage massive amounts of data to transform their industries.</p>\n </li>\n</ul>","blurb":"","authors":[{"authorId":9411,"name":"Judith S. Hurwitz","slug":"judith-hurwitz","description":"Judith Hurwitz is president and CEO of Hurwitz & Associates, specializing in cloud computing, service management, information management, and business strategy.","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9411"}},{"authorId":9727,"name":"Alan Nugent","slug":"alan-nugent","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9727"}},{"authorId":9413,"name":"Fern Halper","slug":"fern-halper","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9413"}},{"authorId":9412,"name":"Marcia Kaufman","slug":"marcia-kaufman","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9412"}}],"primaryCategoryTaxonomy":{"categoryId":33578,"title":"Big Data","slug":"big-data","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33578"}},"secondaryCategoryTaxonomy":{"categoryId":0,"title":null,"slug":null,"_links":null},"tertiaryCategoryTaxonomy":{"categoryId":0,"title":null,"slug":null,"_links":null},"trendingArticles":[{"articleId":192609,"title":"How to Pray the Rosary: A Comprehensive Guide","slug":"how-to-pray-the-rosary","categoryList":["body-mind-spirit","religion-spirituality","christianity","catholicism"],"_links":{"self":"/articles/192609"}},{"articleId":208741,"title":"Kabbalah For Dummies Cheat Sheet","slug":"kabbalah-for-dummies-cheat-sheet","categoryList":["body-mind-spirit","religion-spirituality","kabbalah"],"_links":{"self":"/articles/208741"}},{"articleId":230957,"title":"Nikon D3400 For Dummies Cheat Sheet","slug":"nikon-d3400-dummies-cheat-sheet","categoryList":["home-auto-hobbies","photography"],"_links":{"self":"/articles/230957"}},{"articleId":235851,"title":"Praying the Rosary and Meditating on the Mysteries","slug":"praying-rosary-meditating-mysteries","categoryList":["body-mind-spirit","religion-spirituality","christianity","catholicism"],"_links":{"self":"/articles/235851"}},{"articleId":284787,"title":"What Your Society Says About You","slug":"what-your-society-says-about-you","categoryList":["academics-the-arts","humanities"],"_links":{"self":"/articles/284787"}}],"inThisArticle":[],"relatedArticles":{"fromBook":[{"articleId":207996,"title":"Big Data For Dummies Cheat Sheet","slug":"big-data-for-dummies-cheat-sheet","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/207996"}},{"articleId":168988,"title":"Integrate Big Data with the Traditional Data Warehouse","slug":"integrate-big-data-with-the-traditional-data-warehouse","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168988"}},{"articleId":168987,"title":"Best Practices for Big Data Integration","slug":"best-practices-for-big-data-integration","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168987"}},{"articleId":168985,"title":"How to Analyze Big Data to Get Results","slug":"how-to-analyze-big-data-to-get-results","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168985"}},{"articleId":168986,"title":"Big Data Planning Stages","slug":"big-data-planning-stages","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168986"}}],"fromCategory":[{"articleId":207996,"title":"Big Data For Dummies Cheat Sheet","slug":"big-data-for-dummies-cheat-sheet","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/207996"}},{"articleId":207478,"title":"Statistics for Big Data For Dummies Cheat Sheet","slug":"statistics-for-big-data-for-dummies-cheat-sheet","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/207478"}},{"articleId":207432,"title":"Big Data for Small Business For Dummies Cheat Sheet","slug":"big-data-for-small-business-for-dummies-cheat-sheet","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/207432"}},{"articleId":168988,"title":"Integrate Big Data with the Traditional Data Warehouse","slug":"integrate-big-data-with-the-traditional-data-warehouse","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168988"}},{"articleId":168986,"title":"Big Data Planning Stages","slug":"big-data-planning-stages","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168986"}}]},"hasRelatedBookFromSearch":false,"relatedBook":{"bookId":281637,"slug":"big-data-for-dummies","isbn":"9781118504222","categoryList":["technology","information-technology","data-science","big-data"],"amazon":{"default":"https://www.amazon.com/gp/product/1118504224/ref=as_li_tl?ie=UTF8&tag=wiley01-20","ca":"https://www.amazon.ca/gp/product/1118504224/ref=as_li_tl?ie=UTF8&tag=wiley01-20","indigo_ca":"http://www.tkqlhce.com/click-9208661-13710633?url=https://www.chapters.indigo.ca/en-ca/books/product/1118504224-item.html&cjsku=978111945484","gb":"https://www.amazon.co.uk/gp/product/1118504224/ref=as_li_tl?ie=UTF8&tag=wiley01-20","de":"https://www.amazon.de/gp/product/1118504224/ref=as_li_tl?ie=UTF8&tag=wiley01-20"},"image":{"src":"https://www.dummies.com/wp-content/uploads/big-data-for-dummies-cover-9781118504222-203x255.jpg","width":203,"height":255},"title":"Big Data For Dummies","testBankPinActivationLink":"","bookOutOfPrint":false,"authorsInfo":"<p><b data-author-id=\"34961\">Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy. Alan Nugent has extensive experience in cloud-based big data solutions. Dr. Fern Halper specializes in big data and analytics. Marcia Kaufman specializes in cloud infrastructure, information management, and analytics.</p>","authors":[{"authorId":9727,"name":"Alan Nugent","slug":"alan-nugent","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9727"}},{"authorId":9413,"name":"Fern Halper","slug":"fern-halper","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9413"}},{"authorId":34961,"name":"Judith S. Hurwitz","slug":"judith-s-hurwitz","description":" <p><b>Daniel Kirsch,</b> Managing Director of Hurwitz &#38; Associates, is a thought leader, researcher, author, and consultant in cloud, AI, and security. <b>Judith Hurwitz,</b> President of Hurwitz &#38; Associates, is a consultant, thought leader, and coauthor of 10 books including <i>Augmented Intelligence, Cognitive Computing and Big Data Analytics,</i> and <i>Hybrid Cloud for Dummies</i> ","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/34961"}},{"authorId":9412,"name":"Marcia Kaufman","slug":"marcia-kaufman","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9412"}}],"_links":{"self":"https://dummies-api.dummies.com/v2/books/"}},"collections":[],"articleAds":{"footerAd":"<div class=\"du-ad-region row\" id=\"article_page_adhesion_ad\"><div class=\"du-ad-unit col-md-12\" data-slot-id=\"article_page_adhesion_ad\" data-refreshed=\"false\" \r\n data-target = \"[{&quot;key&quot;:&quot;cat&quot;,&quot;values&quot;:[&quot;technology&quot;,&quot;information-technology&quot;,&quot;data-science&quot;,&quot;big-data&quot;]},{&quot;key&quot;:&quot;isbn&quot;,&quot;values&quot;:[&quot;9781118504222&quot;]}]\" id=\"du-slot-632217ef8e625\"></div></div>","rightAd":"<div class=\"du-ad-region row\" id=\"article_page_right_ad\"><div class=\"du-ad-unit col-md-12\" data-slot-id=\"article_page_right_ad\" data-refreshed=\"false\" \r\n data-target = \"[{&quot;key&quot;:&quot;cat&quot;,&quot;values&quot;:[&quot;technology&quot;,&quot;information-technology&quot;,&quot;data-science&quot;,&quot;big-data&quot;]},{&quot;key&quot;:&quot;isbn&quot;,&quot;values&quot;:[&quot;9781118504222&quot;]}]\" id=\"du-slot-632217ef8eedb\"></div></div>"},"articleType":{"articleType":"Articles","articleList":null,"content":null,"videoInfo":{"videoId":null,"name":null,"accountId":null,"playerId":null,"thumbnailUrl":null,"description":null,"uploadDate":null}},"sponsorship":{"sponsorshipPage":false,"backgroundImage":{"src":null,"width":0,"height":0},"brandingLine":"","brandingLink":"","brandingLogo":{"src":null,"width":0,"height":0},"sponsorAd":"","sponsorEbookTitle":"","sponsorEbookLink":"","sponsorEbookImage":{"src":null,"width":0,"height":0}},"primaryLearningPath":"Advance","lifeExpectancy":null,"lifeExpectancySetFrom":null,"dummiesForKids":"no","sponsoredContent":"no","adInfo":"","adPairKey":[]},"status":"publish","visibility":"public","articleId":168984},{"headers":{"creationTime":"2016-03-26T15:31:57+00:00","modifiedTime":"2016-03-26T15:31:57+00:00","timestamp":"2022-09-14T18:05:35+00:00"},"data":{"breadcrumbs":[{"name":"Technology","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33512"},"slug":"technology","categoryId":33512},{"name":"Information Technology","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33572"},"slug":"information-technology","categoryId":33572},{"name":"Data Science","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33577"},"slug":"data-science","categoryId":33577},{"name":"Big Data","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33578"},"slug":"big-data","categoryId":33578}],"title":"Explore the Big Data Stack","strippedTitle":"explore the big data stack","slug":"explore-the-big-data-stack","canonicalUrl":"","seo":{"metaDescription":"To understand big data, it helps to see how it stacks up — that is, to lay out the components of the architecture. A big data management architecture must inclu","noIndex":0,"noFollow":0},"content":"<p>To understand big data, it helps to see how it stacks up — that is, to lay out the components of the architecture. A big data management architecture must include a variety of services that enable companies to make use of myriad data sources in a fast and effective manner.</p>\n<img src=\"https://www.dummies.com/wp-content/uploads/362970.image0.jpg\" width=\"400\" height=\"262\" alt=\"image0.jpg\"/>\n<p>Here's a closer look at what's in the image and the relationship between the components:</p>\n<ul class=\"level-one\">\n <li><p class=\"first-para\"><b>Interfaces and feeds: </b>On either side of the diagram are indications of interfaces and feeds into and out of both internally managed data and data feeds from external sources. To understand how big data works in the real world, start by understanding this necessity. </p>\n<p class=\"child-para Remember\">What makes big data big is that it relies on picking up lots of data from lots of sources. Therefore, open application programming interfaces (APIs) will be core to any big data architecture.</p>\n<p class=\"child-para\">In addition, keep in mind that interfaces exist at every level and between every layer of the stack. Without integration services, big data can't happen.</p>\n </li>\n <li><p class=\"first-para\"><b>Redundant physical infrastructure: </b>The supporting physical infrastructure is fundamental to the operation and scalability of a big data architecture. Without the availability of robust physical infrastructures, big data would probably not have emerged as such an important trend. </p>\n<p class=\"child-para\">To support an unanticipated or unpredictable volume of data, a physical infrastructure for big data has to be different than that for traditional data. The physical infrastructure is based on a distributed computing model. This means that data may be physically stored in many different locations and can be linked together through networks, the use of a distributed file system, and various big data analytic tools and applications.</p>\n </li>\n <li><p class=\"first-para\"><b>Security infrastructure:</b> The more important big data analysis becomes to companies, the more important it will be to secure that data. For example, if you are a healthcare company, you will probably want to use big data applications to determine changes in demographics or shifts in patient needs.</p>\n<p class=\"child-para\">This data about your constituents needs to be protected both to meet compliance requirements and to protect the patients' privacy. You will need to take into account who is allowed to see the data and under what circumstances they are allowed to do so. You will need to be able to verify the identity of users as well as protect the identity of patients.</p>\n </li>\n <li><p class=\"first-para\"><b>Operational data sources:</b> When you think about big data, understand that you have to incorporate all the data sources that will give you a complete picture of your business and see how the data impacts the way you operate your business.</p>\n<p class=\"child-para\">Traditionally, an operational data source consisted of highly structured data managed by the line of business in a relational database. But as the world changes, it is important to understand that operational data now has to encompass a broader set of data sources.</p>\n </li>\n</ul>","description":"<p>To understand big data, it helps to see how it stacks up — that is, to lay out the components of the architecture. A big data management architecture must include a variety of services that enable companies to make use of myriad data sources in a fast and effective manner.</p>\n<img src=\"https://www.dummies.com/wp-content/uploads/362970.image0.jpg\" width=\"400\" height=\"262\" alt=\"image0.jpg\"/>\n<p>Here's a closer look at what's in the image and the relationship between the components:</p>\n<ul class=\"level-one\">\n <li><p class=\"first-para\"><b>Interfaces and feeds: </b>On either side of the diagram are indications of interfaces and feeds into and out of both internally managed data and data feeds from external sources. To understand how big data works in the real world, start by understanding this necessity. </p>\n<p class=\"child-para Remember\">What makes big data big is that it relies on picking up lots of data from lots of sources. Therefore, open application programming interfaces (APIs) will be core to any big data architecture.</p>\n<p class=\"child-para\">In addition, keep in mind that interfaces exist at every level and between every layer of the stack. Without integration services, big data can't happen.</p>\n </li>\n <li><p class=\"first-para\"><b>Redundant physical infrastructure: </b>The supporting physical infrastructure is fundamental to the operation and scalability of a big data architecture. Without the availability of robust physical infrastructures, big data would probably not have emerged as such an important trend. </p>\n<p class=\"child-para\">To support an unanticipated or unpredictable volume of data, a physical infrastructure for big data has to be different than that for traditional data. The physical infrastructure is based on a distributed computing model. This means that data may be physically stored in many different locations and can be linked together through networks, the use of a distributed file system, and various big data analytic tools and applications.</p>\n </li>\n <li><p class=\"first-para\"><b>Security infrastructure:</b> The more important big data analysis becomes to companies, the more important it will be to secure that data. For example, if you are a healthcare company, you will probably want to use big data applications to determine changes in demographics or shifts in patient needs.</p>\n<p class=\"child-para\">This data about your constituents needs to be protected both to meet compliance requirements and to protect the patients' privacy. You will need to take into account who is allowed to see the data and under what circumstances they are allowed to do so. You will need to be able to verify the identity of users as well as protect the identity of patients.</p>\n </li>\n <li><p class=\"first-para\"><b>Operational data sources:</b> When you think about big data, understand that you have to incorporate all the data sources that will give you a complete picture of your business and see how the data impacts the way you operate your business.</p>\n<p class=\"child-para\">Traditionally, an operational data source consisted of highly structured data managed by the line of business in a relational database. But as the world changes, it is important to understand that operational data now has to encompass a broader set of data sources.</p>\n </li>\n</ul>","blurb":"","authors":[{"authorId":9411,"name":"Judith S. Hurwitz","slug":"judith-hurwitz","description":"Judith Hurwitz is president and CEO of Hurwitz & Associates, specializing in cloud computing, service management, information management, and business strategy.","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9411"}},{"authorId":9727,"name":"Alan Nugent","slug":"alan-nugent","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9727"}},{"authorId":9413,"name":"Fern Halper","slug":"fern-halper","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9413"}},{"authorId":9412,"name":"Marcia Kaufman","slug":"marcia-kaufman","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9412"}}],"primaryCategoryTaxonomy":{"categoryId":33578,"title":"Big Data","slug":"big-data","_links":{"self":"https://dummies-api.dummies.com/v2/categories/33578"}},"secondaryCategoryTaxonomy":{"categoryId":0,"title":null,"slug":null,"_links":null},"tertiaryCategoryTaxonomy":{"categoryId":0,"title":null,"slug":null,"_links":null},"trendingArticles":[{"articleId":192609,"title":"How to Pray the Rosary: A Comprehensive Guide","slug":"how-to-pray-the-rosary","categoryList":["body-mind-spirit","religion-spirituality","christianity","catholicism"],"_links":{"self":"/articles/192609"}},{"articleId":208741,"title":"Kabbalah For Dummies Cheat Sheet","slug":"kabbalah-for-dummies-cheat-sheet","categoryList":["body-mind-spirit","religion-spirituality","kabbalah"],"_links":{"self":"/articles/208741"}},{"articleId":230957,"title":"Nikon D3400 For Dummies Cheat Sheet","slug":"nikon-d3400-dummies-cheat-sheet","categoryList":["home-auto-hobbies","photography"],"_links":{"self":"/articles/230957"}},{"articleId":235851,"title":"Praying the Rosary and Meditating on the Mysteries","slug":"praying-rosary-meditating-mysteries","categoryList":["body-mind-spirit","religion-spirituality","christianity","catholicism"],"_links":{"self":"/articles/235851"}},{"articleId":284787,"title":"What Your Society Says About You","slug":"what-your-society-says-about-you","categoryList":["academics-the-arts","humanities"],"_links":{"self":"/articles/284787"}}],"inThisArticle":[],"relatedArticles":{"fromBook":[{"articleId":207996,"title":"Big Data For Dummies Cheat Sheet","slug":"big-data-for-dummies-cheat-sheet","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/207996"}},{"articleId":168988,"title":"Integrate Big Data with the Traditional Data Warehouse","slug":"integrate-big-data-with-the-traditional-data-warehouse","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168988"}},{"articleId":168987,"title":"Best Practices for Big Data Integration","slug":"best-practices-for-big-data-integration","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168987"}},{"articleId":168985,"title":"How to Analyze Big Data to Get Results","slug":"how-to-analyze-big-data-to-get-results","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168985"}},{"articleId":168986,"title":"Big Data Planning Stages","slug":"big-data-planning-stages","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168986"}}],"fromCategory":[{"articleId":207996,"title":"Big Data For Dummies Cheat Sheet","slug":"big-data-for-dummies-cheat-sheet","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/207996"}},{"articleId":207478,"title":"Statistics for Big Data For Dummies Cheat Sheet","slug":"statistics-for-big-data-for-dummies-cheat-sheet","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/207478"}},{"articleId":207432,"title":"Big Data for Small Business For Dummies Cheat Sheet","slug":"big-data-for-small-business-for-dummies-cheat-sheet","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/207432"}},{"articleId":168988,"title":"Integrate Big Data with the Traditional Data Warehouse","slug":"integrate-big-data-with-the-traditional-data-warehouse","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168988"}},{"articleId":168986,"title":"Big Data Planning Stages","slug":"big-data-planning-stages","categoryList":["technology","information-technology","data-science","big-data"],"_links":{"self":"https://dummies-api.dummies.com/v2/articles/168986"}}]},"hasRelatedBookFromSearch":false,"relatedBook":{"bookId":281637,"slug":"big-data-for-dummies","isbn":"9781118504222","categoryList":["technology","information-technology","data-science","big-data"],"amazon":{"default":"https://www.amazon.com/gp/product/1118504224/ref=as_li_tl?ie=UTF8&tag=wiley01-20","ca":"https://www.amazon.ca/gp/product/1118504224/ref=as_li_tl?ie=UTF8&tag=wiley01-20","indigo_ca":"http://www.tkqlhce.com/click-9208661-13710633?url=https://www.chapters.indigo.ca/en-ca/books/product/1118504224-item.html&cjsku=978111945484","gb":"https://www.amazon.co.uk/gp/product/1118504224/ref=as_li_tl?ie=UTF8&tag=wiley01-20","de":"https://www.amazon.de/gp/product/1118504224/ref=as_li_tl?ie=UTF8&tag=wiley01-20"},"image":{"src":"https://www.dummies.com/wp-content/uploads/big-data-for-dummies-cover-9781118504222-203x255.jpg","width":203,"height":255},"title":"Big Data For Dummies","testBankPinActivationLink":"","bookOutOfPrint":false,"authorsInfo":"<p><b data-author-id=\"34961\">Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy. Alan Nugent has extensive experience in cloud-based big data solutions. Dr. Fern Halper specializes in big data and analytics. Marcia Kaufman specializes in cloud infrastructure, information management, and analytics.</p>","authors":[{"authorId":9727,"name":"Alan Nugent","slug":"alan-nugent","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9727"}},{"authorId":9413,"name":"Fern Halper","slug":"fern-halper","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9413"}},{"authorId":34961,"name":"Judith S. Hurwitz","slug":"judith-s-hurwitz","description":" <p><b>Daniel Kirsch,</b> Managing Director of Hurwitz &#38; Associates, is a thought leader, researcher, author, and consultant in cloud, AI, and security. <b>Judith Hurwitz,</b> President of Hurwitz &#38; Associates, is a consultant, thought leader, and coauthor of 10 books including <i>Augmented Intelligence, Cognitive Computing and Big Data Analytics,</i> and <i>Hybrid Cloud for Dummies</i> ","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/34961"}},{"authorId":9412,"name":"Marcia Kaufman","slug":"marcia-kaufman","description":" <p><b>Judith Hurwitz</b> is an expert in cloud computing, information management, and business strategy.</p> <p><b>Alan Nugent</b> has extensive experience in cloud-based big data solutions.</p> <p><b>Dr. Fern Halper</b> specializes in big data and analytics.</p> <p><b>Marcia Kaufman</b> specializes in cloud infrastructure, information management, and analytics.</p>","hasArticle":false,"_links":{"self":"https://dummies-api.dummies.com/v2/authors/9412"}}],"_links":{"self":"https://dummies-api.dummies.com/v2/books/"}},"collections":[],"articleAds":{"footerAd":"<div class=\"du-ad-region row\" id=\"article_page_adhesion_ad\"><div class=\"du-ad-unit col-md-12\" data-slot-id=\"article_page_adhesion_ad\" data-refreshed=\"false\" \r\n data-target = \"[{&quot;key&quot;:&quot;cat&quot;,&quot;values&quot;:[&quot;technology&quot;,&quot;information-technology&quot;,&quot;data-science&quot;,&quot;big-data&quot;]},{&quot;key&quot;:&quot;isbn&quot;,&quot;values&quot;:[&quot;9781118504222&quot;]}]\" id=\"du-slot-632217ef7b94c\"></div></div>","rightAd":"<div class=\"du-ad-region row\" id=\"article_page_right_ad\"><div class=\"du-ad-unit col-md-12\" data-slot-id=\"article_page_right_ad\" data-refreshed=\"false\" \r\n data-target = \"[{&quot;key&quot;:&quot;cat&quot;,&quot;values&quot;:[&quot;technology&quot;,&quot;information-technology&quot;,&quot;data-science&quot;,&quot;big-data&quot;]},{&quot;key&quot;:&quot;isbn&quot;,&quot;values&quot;:[&quot;9781118504222&quot;]}]\" id=\"du-slot-632217ef7c1c5\"></div></div>"},"articleType":{"articleType":"Articles","articleList":null,"content":null,"videoInfo":{"videoId":null,"name":null,"accountId":null,"playerId":null,"thumbnailUrl":null,"description":null,"uploadDate":null}},"sponsorship":{"sponsorshipPage":false,"backgroundImage":{"src":null,"width":0,"height":0},"brandingLine":"","brandingLink":"","brandingLogo":{"src":null,"width":0,"height":0},"sponsorAd":"","sponsorEbookTitle":"","sponsorEbookLink":"","sponsorEbookImage":{"src":null,"width":0,"height":0}},"primaryLearningPath":"Advance","lifeExpectancy":null,"lifeExpectancySetFrom":null,"dummiesForKids":"no","sponsoredContent":"no","adInfo":"","adPairKey":[]},"status":"publish","visibility":"public","articleId":168983}],"_links":{"self":{"self":"https://dummies-api.dummies.com/v2/categories/33578/categoryArticles?sortField=time&sortOrder=1&size=10&offset=0"},"next":{"self":"https://dummies-api.dummies.com/v2/categories/33578/categoryArticles?sortField=time&sortOrder=1&size=10&offset=10"},"last":{"self":"https://dummies-api.dummies.com/v2/categories/33578/categoryArticles?sortField=time&sortOrder=1&size=10&offset=164"}}},"objectTitle":"","status":"success","pageType":"article-category","objectId":"33578","page":1,"sortField":"time","sortOrder":1,"categoriesIds":[],"articleTypes":[],"filterData":{"categoriesFilter":[{"itemId":0,"itemName":"All Categories","count":174}],"articleTypeFilter":[{"articleType":"All Types","count":174},{"articleType":"Articles","count":171},{"articleType":"Cheat Sheet","count":3}]},"filterDataLoadedStatus":"success","pageSize":10},"adsState":{"pageScripts":{"headers":{"timestamp":"2025-04-17T15:50:01+00:00"},"adsId":0,"data":{"scripts":[{"pages":["all"],"location":"header","script":"<!--Optimizely Script-->\r\n<script src=\"https://cdn.optimizely.com/js/10563184655.js\"></script>","enabled":false},{"pages":["all"],"location":"header","script":"<!-- comScore Tag -->\r\n<script>var _comscore = _comscore || [];_comscore.push({ c1: \"2\", c2: \"15097263\" });(function() {var s = document.createElement(\"script\"), el = document.getElementsByTagName(\"script\")[0]; s.async = true;s.src = (document.location.protocol == \"https:\" ? \"https://sb\" : \"http://b\") + \".scorecardresearch.com/beacon.js\";el.parentNode.insertBefore(s, el);})();</script><noscript><img src=\"https://sb.scorecardresearch.com/p?c1=2&c2=15097263&cv=2.0&cj=1\" /></noscript>\r\n<!-- / comScore Tag -->","enabled":true},{"pages":["all"],"location":"footer","script":"<!--BEGIN QUALTRICS WEBSITE FEEDBACK SNIPPET-->\r\n<script type='text/javascript'>\r\n(function(){var g=function(e,h,f,g){\r\nthis.get=function(a){for(var a=a+\"=\",c=document.cookie.split(\";\"),b=0,e=c.length;b<e;b++){for(var d=c[b];\" \"==d.charAt(0);)d=d.substring(1,d.length);if(0==d.indexOf(a))return d.substring(a.length,d.length)}return null};\r\nthis.set=function(a,c){var b=\"\",b=new Date;b.setTime(b.getTime()+6048E5);b=\"; expires=\"+b.toGMTString();document.cookie=a+\"=\"+c+b+\"; path=/; \"};\r\nthis.check=function(){var a=this.get(f);if(a)a=a.split(\":\");else if(100!=e)\"v\"==h&&(e=Math.random()>=e/100?0:100),a=[h,e,0],this.set(f,a.join(\":\"));else return!0;var c=a[1];if(100==c)return!0;switch(a[0]){case \"v\":return!1;case \"r\":return c=a[2]%Math.floor(100/c),a[2]++,this.set(f,a.join(\":\")),!c}return!0};\r\nthis.go=function(){if(this.check()){var a=document.createElement(\"script\");a.type=\"text/javascript\";a.src=g;document.body&&document.body.appendChild(a)}};\r\nthis.start=function(){var t=this;\"complete\"!==document.readyState?window.addEventListener?window.addEventListener(\"load\",function(){t.go()},!1):window.attachEvent&&window.attachEvent(\"onload\",function(){t.go()}):t.go()};};\r\ntry{(new g(100,\"r\",\"QSI_S_ZN_5o5yqpvMVjgDOuN\",\"https://zn5o5yqpvmvjgdoun-wiley.siteintercept.qualtrics.com/SIE/?Q_ZID=ZN_5o5yqpvMVjgDOuN\")).start()}catch(i){}})();\r\n</script><div id='ZN_5o5yqpvMVjgDOuN'><!--DO NOT REMOVE-CONTENTS PLACED HERE--></div>\r\n<!--END WEBSITE FEEDBACK SNIPPET-->","enabled":false},{"pages":["all"],"location":"header","script":"<!-- Hotjar Tracking Code for http://www.dummies.com -->\r\n<script>\r\n (function(h,o,t,j,a,r){\r\n h.hj=h.hj||function(){(h.hj.q=h.hj.q||[]).push(arguments)};\r\n h._hjSettings={hjid:257151,hjsv:6};\r\n a=o.getElementsByTagName('head')[0];\r\n r=o.createElement('script');r.async=1;\r\n r.src=t+h._hjSettings.hjid+j+h._hjSettings.hjsv;\r\n a.appendChild(r);\r\n })(window,document,'https://static.hotjar.com/c/hotjar-','.js?sv=');\r\n</script>","enabled":false},{"pages":["article"],"location":"header","script":"<!-- //Connect Container: dummies --> <script src=\"//get.s-onetag.com/bffe21a1-6bb8-4928-9449-7beadb468dae/tag.min.js\" async defer></script>","enabled":true},{"pages":["homepage"],"location":"header","script":"<meta name=\"facebook-domain-verification\" content=\"irk8y0irxf718trg3uwwuexg6xpva0\" />","enabled":true},{"pages":["homepage","article","category","search"],"location":"footer","script":"<!-- Facebook Pixel Code -->\r\n<noscript>\r\n<img height=\"1\" width=\"1\" src=\"https://www.facebook.com/tr?id=256338321977984&ev=PageView&noscript=1\"/>\r\n</noscript>\r\n<!-- End Facebook Pixel Code -->","enabled":true}]}},"pageScriptsLoadedStatus":"success"},"navigationState":{"navigationCollections":[{"collectionId":287568,"title":"BYOB (Be Your Own Boss)","hasSubCategories":false,"url":"/collection/for-the-entry-level-entrepreneur-287568"},{"collectionId":293237,"title":"Be a Rad Dad","hasSubCategories":false,"url":"/collection/be-the-best-dad-293237"},{"collectionId":295890,"title":"Career Shifting","hasSubCategories":false,"url":"/collection/career-shifting-295890"},{"collectionId":294090,"title":"Contemplating the Cosmos","hasSubCategories":false,"url":"/collection/theres-something-about-space-294090"},{"collectionId":287563,"title":"For Those Seeking Peace of Mind","hasSubCategories":false,"url":"/collection/for-those-seeking-peace-of-mind-287563"},{"collectionId":287570,"title":"For the Aspiring Aficionado","hasSubCategories":false,"url":"/collection/for-the-bougielicious-287570"},{"collectionId":291903,"title":"For the Budding Cannabis Enthusiast","hasSubCategories":false,"url":"/collection/for-the-budding-cannabis-enthusiast-291903"},{"collectionId":299891,"title":"For the College Bound","hasSubCategories":false,"url":"/collection/for-the-college-bound-299891"},{"collectionId":291934,"title":"For the Exam-Season Crammer","hasSubCategories":false,"url":"/collection/for-the-exam-season-crammer-291934"},{"collectionId":301547,"title":"For the Game Day Prepper","hasSubCategories":false,"url":"/collection/big-game-day-prep-made-easy-301547"}],"navigationCollectionsLoadedStatus":"success","navigationCategories":{"books":{"0":{"data":[{"categoryId":33512,"title":"Technology","hasSubCategories":true,"url":"/category/books/technology-33512"},{"categoryId":33662,"title":"Academics & The Arts","hasSubCategories":true,"url":"/category/books/academics-the-arts-33662"},{"categoryId":33809,"title":"Home, Auto, & Hobbies","hasSubCategories":true,"url":"/category/books/home-auto-hobbies-33809"},{"categoryId":34038,"title":"Body, Mind, & Spirit","hasSubCategories":true,"url":"/category/books/body-mind-spirit-34038"},{"categoryId":34224,"title":"Business, Careers, & Money","hasSubCategories":true,"url":"/category/books/business-careers-money-34224"}],"breadcrumbs":[],"categoryTitle":"Level 0 Category","mainCategoryUrl":"/category/books/level-0-category-0"}},"articles":{"0":{"data":[{"categoryId":33512,"title":"Technology","hasSubCategories":true,"url":"/category/articles/technology-33512"},{"categoryId":33662,"title":"Academics & The Arts","hasSubCategories":true,"url":"/category/articles/academics-the-arts-33662"},{"categoryId":33809,"title":"Home, Auto, & Hobbies","hasSubCategories":true,"url":"/category/articles/home-auto-hobbies-33809"},{"categoryId":34038,"title":"Body, Mind, & Spirit","hasSubCategories":true,"url":"/category/articles/body-mind-spirit-34038"},{"categoryId":34224,"title":"Business, Careers, & Money","hasSubCategories":true,"url":"/category/articles/business-careers-money-34224"}],"breadcrumbs":[],"categoryTitle":"Level 0 Category","mainCategoryUrl":"/category/articles/level-0-category-0"}}},"navigationCategoriesLoadedStatus":"success"},"searchState":{"searchList":[],"searchStatus":"initial","relatedArticlesList":[],"relatedArticlesStatus":"initial"},"routeState":{"name":"ArticleCategory","path":"/category/articles/big-data-33578/","hash":"","query":{},"params":{"category":"big-data-33578"},"fullPath":"/category/articles/big-data-33578/","meta":{"routeType":"category","breadcrumbInfo":{"suffix":"Articles","baseRoute":"/category/articles"},"prerenderWithAsyncData":true},"from":{"name":null,"path":"/","hash":"","query":{},"params":{},"fullPath":"/","meta":{}}},"profileState":{"auth":{},"userOptions":{},"status":"success"}}
Logo
  • Articles Open Article Categories
  • Books Open Book Categories
  • Collections Open Collections list
  • Custom Solutions

Article Categories

Book Categories

Collections

Explore all collections
BYOB (Be Your Own Boss)
Be a Rad Dad
Career Shifting
Contemplating the Cosmos
For Those Seeking Peace of Mind
For the Aspiring Aficionado
For the Budding Cannabis Enthusiast
For the College Bound
For the Exam-Season Crammer
For the Game Day Prepper
Log In
  • Home
  • Technology Articles
  • Information Technology Articles
  • Data Science Articles
  • Big Data Articles

Big Data Articles

What's the biggest dataset you can imagine? Well, multiply that by a thousand and you're probably still not close to the mammoth piles of info that big data pros process. Learn all about it here.

Articles From Big Data

page 1
page 2
page 3
page 4
page 5
page 6
page 7
page 8
page 9
page 10
page 11
page 12
page 13
page 14
page 15
page 16
page 17
page 18

Filter Results

174 results
174 results
Big Data Beyond Boundaries: Unstructured Data Orchestration

Article / Updated 12-01-2023

Getting the most out of your unstructured data is an essential task for any organization these days, especially when considering the disparate storage systems, applications, and user locations. So, it’s not an accident that data orchestration is the term that brings everything together. Bringing all your data together shares similarities with conducting an orchestra. Instead of combining the violin, oboe, and cello, this brand of orchestration combines distributed data types from different places, platforms, and locations working as a cohesive entity presented to applications or users anywhere. That’s because historically, accessing high-performance data outside of your computer network was inefficient. Because the storage infrastructure existed in a silo, systems like HPC Parallel (which lets users store and access shared data across multiple networked storage nodes), Enterprise NAS (which allows large-scale storage and access to other networks), and Global Namespace (virtually simplifies network file systems) were limited when it came to sharing. Because each operated independently, the data within each system was siloed making it a problem collaborating with data sets over multiple locations. Collaboration was possible, but too often you lost the ability to have high performance. This Boolean logic decreased potential because having an IT architecture that supported both high performance and collaboration with data sets from different storage silos typically became an either/or decision: You were forced to choose one but never both. What is data orchestration? Data orchestration is the automated process of taking siloed data from multiple data storage systems and locations, combining and organizing it into a single namespace. Then a high-performance file system can place data in the edge service, data center, or cloud service most optimal for the workload. The recent rise of data analytic applications and artificial intelligence (AI) capabilities has accelerated the use of data across different locations and even different organizations. In the next data cycle, organizations will need both high-performance and agility with their data to compete and thrive in a competitive environment. That means data no longer has a 1:1 relationship with the applications and compute environment that generated it. It needs to be used, analyzed, and repurposed with different AI models and alternate workloads, and across a remote, collaborative environment. Hammerspace’s technology makes data available to different foundational models, remote applications, decentralized compute clusters, and remote workers to automate and streamline data-driven development programs, data insights, and business decision making. This capability enables a unified, fast, and efficient global data environment for the entire workflow — from data creation to processing, collaboration, and archiving across edge devices, data centers, and public and private clouds. Control of enterprise data services for governance, security, data protection, and compliance can now be implemented globally at a file-granular level across all storage types and locations. Applications and AI models can access data stored in remote locations while using automated orchestration tools to provide high-performance local access when needed for processing. Organizations can grow their talent pools with access to team members no matter where they reside. Decentralizing the data center Data collection has become more prominent, and the traditional system of centralized data management has limitations. Issues of centralized data storage can limit the amount of data available to applications. Then, there are the high infrastructure costs when multiple applications are needed to manage and move data, multiple copies of data are retained in different storage systems, and more headcount is needed to manage the complex, disconnected infrastructure environment. Such setbacks suggest that the data center is no longer the center of data and storage system constraints should no longer define data architectures. Hammerspace specializes in decentralized environments, where data may need to span two or more sites and possibly one or more cloud providers and regions, and/or where a remote workforce needs to collaborate in real time. It enables a global data environment by providing a unified, parallel global file system. Enabling a global data environment Hammerspace completely revolutionizes previously held notions of how unstructured data architectures should be designed, delivering the performance needed across distributed environments to Free workloads from data silos. Eliminate copy proliferation. Provide direct data access through local metadata to applications and users, no matter where the data is stored. This technology allows organizations to take full advantage of the performance capabilities of any server, storage system, and network anywhere in the world. This capability enables a unified, fast, and efficient global data environment for the entire workflow, from data creation to processing, collaboration, and archiving across edge devices, data centers, and public and private clouds. The days of enterprises struggling with a siloed, distributed, and inefficient data environment are over. It’s time to start expecting more from data architectures with automated data orchestration. Find out how by downloading Unstructured Data Orchestration For Dummies, Hammerspace Special Edition, here.

View Article
Big Data Big Data for Small Business For Dummies Cheat Sheet

Cheat Sheet / Updated 04-12-2022

Big data makes big headlines, but it’s much more than just a buzz phrase or the latest business fad. The phenomenon is very real and it’s producing concrete benefits in so many different areas – particularly in business. Here you will get to the heart of big data as a business owner or manager: You will take a look at the key terminology you need to understand the crucial big data skills for businesses, ten steps to using big data to make better decisions, and tips for communicating insights from data to your colleagues.

View Cheat Sheet
Big Data Statistics for Big Data For Dummies Cheat Sheet

Cheat Sheet / Updated 03-10-2022

Summary statistical measures represent the key properties of a sample or population as a single numerical value. This has the advantage of providing important information in a very compact form. It also simplifies comparing multiple samples or populations. Summary statistical measures can be divided into three types: measures of central tendency, measures of central dispersion, and measures of association.

View Cheat Sheet
Big Data Big Data For Dummies Cheat Sheet

Cheat Sheet / Updated 02-09-2022

To stay competitive today, companies must find practical ways to deal with big data — that is, to learn new ways to capture and analyze growing amounts of information about customers, products, and services. Data is becoming increasingly complex in structured and unstructured ways. New sources of data come from machines, such as sensors; social business sites; and website interaction, such as click-stream data. Meeting these changing business requirements demands that the right information be available at the right time.

View Cheat Sheet
Big Data Integrate Big Data with the Traditional Data Warehouse

Article / Updated 03-26-2016

While the worlds of big data and the traditional data warehouse will intersect, they are unlikely to merge anytime soon. Think of a data warehouse as a system of record for business intelligence, much like a customer relationship management (CRM) or accounting system. These systems are highly structured and optimized for specific purposes. In addition, these systems of record tend to be highly centralized. The diagram shows a typical approach to data flows with warehouses and marts: Organizations will inevitably continue to use data warehouses to manage the type of structured and operational data that characterizes systems of record. These data warehouses will still provide business analysts with the ability to analyze key data, trends, and so on. However, the advent of big data is both challenging the role of the data warehouse and providing a complementary approach. Think of the relationship between the data warehouse and big data as merging to become a hybrid structure. In this hybrid model, the highly structured optimized operational data remains in the tightly controlled data warehouse, while the data that is highly distributed and subject to change in real time is controlled by a Hadoop-based (or similar NoSQL) infrastructure. It's inevitable that operational and structured data will have to interact in the world of big data, where the information sources have not (necessarily) been cleansed or profiled. Increasingly, organizations are understanding that they have a business requirement to be able to combine traditional data warehouses with their historical business data sources with less structured and vetted big data sources. A hybrid approach supporting traditional and big data sources can help to accomplish these business goals.

View Article
Big Data Best Practices for Big Data Integration

Article / Updated 03-26-2016

Many companies are exploring big data problems and coming up with some innovative solutions. Now is the time to pay attention to some best practices, or basic principles, that will serve you well as you begin your big data journey. In reality, big data integration fits into the overall process of integration of data across your company. Therefore, you can't simply toss aside everything you have learned from data integration of traditional data sources. The same rules apply whether you are thinking about traditional data management or big data management. Keep these key issues at the top of your priority list for big data integration: Keep data quality in perspective. Your emphasis on data quality depends on the stage of your big data analysis. Don't expect to be able to control data quality when you do your initial analysis on huge volumes of data. However, when you narrow down your big data to identify a subset that is most meaningful to your organization, this is when you need to focus on data quality. Ultimately, data quality becomes important if you want your results to be understood n context with your historical data. As your company relies more and more on analytics as a key planning tool, data quality can mean the difference between success and failure. Consider real-time data requirements. Big data will bring streaming data to the forefront. Therefore, you will have to have a clear understanding of how you integrate data in motion into your environment for predictable analysis. Don't create new silos of information. While so much of the emphasis around big data is focused on Hadoop and other unstructured and semi-structured sources, remember that you have to manage this data in context with the business. You will therefore need to integrate these sources with your line of business data and your data warehouse.

View Article
Big Data Big Data Planning Stages

Article / Updated 03-26-2016

Four stages are part of the planning process that applies to big data. As more businesses begin to use the cloud as a way to deploy new and innovative services to customers, the role of data analysis will explode. Therefore, consider another part of your planning process and add three more stages to your data cycle. Stage 1: Planning with data: The only way to make sure that business leaders are taking a balanced perspective on all the elements of the business is to have a clear understanding of how data sources are related. The business needs a road map for determining what data is needed to plan for new strategies and new directions. Stage 2: Doing the analysis: Executing on big data analysis requires learning a set of new tools and new skills. Many organizations will need to hire some big data scientists who can understand how to take this massive amount of disparate data and begin to understand how all the data elements relate in the context of the business problem or opportunity. Stage 3: Checking the results: Make sure you aren’t relying on data sources that will take you in the wrong direction. Many companies will use third-party data sources and may not take the time to vet the quality of the data, but you have to make sure that you are on a strong foundation. Stage 4: Acting on the plan: Each time a business initiates a new strategy, it is critical to constantly create a big data business evaluation cycle. This approach of acting based on results of big data analytics and then testing the results of executing business strategy is the key to success. Stage 5: Monitoring in real time: Big data analytics enables you to monitor data in near real time proactively. This can have a profound impact on your business. If you are a pharmaceutical company conducting a clinical trial, you may be able to adjust or cancel a trial to avoid a lawsuit. Stage 6: Adjusting the impact: When your company has the tools to monitor continuously, it is possible to adjust processes and strategy based on data analytics. Being able to monitor quickly means that a process can be changed earlier and result in better overall quality. Stage 7: Enabling experimentation: Combining experimentation with real-time monitoring and rapid adjustment can transform a business strategy. You have less risk with experimentation because you can change directions and outcomes more easily if you are armed with the right data. The greatest challenge for the business is to be able to look into the future and anticipate what might change and why. Companies want to be able to make informed decisions in a faster and more efficient manner. The business wants to apply that knowledge to take action that can change business outcomes. Leaders also need to understand the nuances of the business impacts that are across product lines and their partner ecosystem. The best businesses take a holistic approach to data.

View Article
Big Data How to Analyze Big Data to Get Results

Article / Updated 03-26-2016

Big data is most useful if you can do something with it, but how do you analyze it? Companies like Amazon and Google are masters at analyzing big data. And they use the resulting knowledge to gain a competitive advantage. Just think about Amazon's recommendation engine. The company takes all your buying history together with what it knows about you, your buying patterns, and the buying patterns of people like you to come up with some pretty good suggestions. It's a marketing machine, and its big data analytics capabilities have made it extremely successful. The ability to analyze big data provides unique opportunities for your organization as well. You'll be able to expand the kind of analysis you can do. Instead of being limited to sampling large data sets, you can now use much more detailed and complete data to do your analysis. However, analyzing big data can also be challenging. Changing algorithms and technology, even for basic data analysis, often has to be addressed with big data. The first question that you need to ask yourself before you dive into big data analysis is what problem are you trying to solve? You may not even be sure of what you are looking for. You know you have lots of data that you think you can get valuable insight from. And certainly, patterns can emerge from that data before you understand why they are there. If you think about it though, you're sure to have an idea of what you're interested in. For instance, are you interested in predicting customer behavior to prevent churn? Do you want to analyze the driving patterns of your customers for insurance premium purposes? Are you interested in looking at your system log data to ultimately predict when problems might occur? The kind of high-level problem is going to drive the analytics you decide to use. Alternately, if you're not exactly sure of the business problem you're trying to solve, maybe you need to look at areas in your business that need improvement. Even an analytics-driven strategy — targeted at the right area — can provide useful results with big data. When it comes to analytics, you might consider a range of possible kinds, which are briefly outlined in the table. Analysis Type Description Basic analytics for insight Slicing and dicing of data, reporting, simple visualizations, basic monitoring. Advanced analytics for insight More complex analysis such as predictive modeling and other pattern-matching techniques. Operationalized analytics Analytics become part of the business process. Monetized analytics Analytics are utilized to directly drive revenue.

View Article
Big Data Ten Hot Big Data Trends

Article / Updated 03-26-2016

As you enter the world of big data, you'll need to absorb many new types of database and data-management technologies. Here are the top-ten big data trends: Hadoop is becoming the underpinning for distributed big data management. Hadoop is a distributed file system that can be used in conjunction with MapReduce to process and analyze massive amounts of data, enabling the big data trend. Hadoop will be tightly integrated into data warehousing technologies so that structured and unstructured data can be integrated more effectively. Big data makes it possible to leverage data from sensors to change business outcomes. More and more businesses are using highly sophisticated sensors on the equipment that runs their operations. New innovations in big data technology are making it possible to analyze all this data to get advanced notification of problems that can be fixed to protect the business. Big data can help a business initiative become a real-time action to increase revenue. Companies in markets such as retail are using real-time streaming data analytics to keep track of customer actions and offer incentives to increase revenue per customer. Big data can be integrated with historical data warehouses to transform planning. Big data can provide a company with a better understanding of massive amounts of data about their business. This information about the current state of the business can be combined with historical data to get a full view of the context for business change. Big data can change the way diseases are managed by adding predictive analytics. Increasingly, healthcare practitioners are looking to big data solutions to gain insights into disease by compare symptoms and test results to databases of results from hundreds of thousands of other cases. This allows practitioners to more quickly predict outcomes and save lives. Cloud computing will transform the way that data will be managed in the future. Cloud computing is invaluable as a tool to support the expansion of big data. Increasingly, cloud services that are optimized for data will mean that many more services and delivery models will make big data more practical for companies of all sizes. Security and governance will be the difference between success and failure of businesses leveraging big data. Big data can be a huge benefit, but it isn't risk-free. Companies will discover that if they are not careful, it is possible to expose private information through big data analysis. Companies need to balance the need to analyze results with best practices for security and governance. Veracity, or truthfulness, of big data will become the most important issue for the coming year. Many companies can get carried away with the ability to analyze massive amounts of data and get back compelling results that predict business outcomes. Therefore, companies will find that the truthfulness of the data must become a top priority or decision making will suffer. As big data moves out of the experimental stage, more packaged offerings will be developed. Most big data projects initiated over the past few years have been experimental. Companies are cautiously working with new tools and technology. Now big data is about to enter the mainstream. Lots of packaged big data offerings will flood the market. Use cases and new innovative ways to apply big data will explode. Early successes with big data in different industries such as manufacturing, retail, and healthcare will lead to many more industries looking at ways to leverage massive amounts of data to transform their industries.

View Article
Big Data Explore the Big Data Stack

Article / Updated 03-26-2016

To understand big data, it helps to see how it stacks up — that is, to lay out the components of the architecture. A big data management architecture must include a variety of services that enable companies to make use of myriad data sources in a fast and effective manner. Here's a closer look at what's in the image and the relationship between the components: Interfaces and feeds: On either side of the diagram are indications of interfaces and feeds into and out of both internally managed data and data feeds from external sources. To understand how big data works in the real world, start by understanding this necessity. What makes big data big is that it relies on picking up lots of data from lots of sources. Therefore, open application programming interfaces (APIs) will be core to any big data architecture. In addition, keep in mind that interfaces exist at every level and between every layer of the stack. Without integration services, big data can't happen. Redundant physical infrastructure: The supporting physical infrastructure is fundamental to the operation and scalability of a big data architecture. Without the availability of robust physical infrastructures, big data would probably not have emerged as such an important trend. To support an unanticipated or unpredictable volume of data, a physical infrastructure for big data has to be different than that for traditional data. The physical infrastructure is based on a distributed computing model. This means that data may be physically stored in many different locations and can be linked together through networks, the use of a distributed file system, and various big data analytic tools and applications. Security infrastructure: The more important big data analysis becomes to companies, the more important it will be to secure that data. For example, if you are a healthcare company, you will probably want to use big data applications to determine changes in demographics or shifts in patient needs. This data about your constituents needs to be protected both to meet compliance requirements and to protect the patients' privacy. You will need to take into account who is allowed to see the data and under what circumstances they are allowed to do so. You will need to be able to verify the identity of users as well as protect the identity of patients. Operational data sources: When you think about big data, understand that you have to incorporate all the data sources that will give you a complete picture of your business and see how the data impacts the way you operate your business. Traditionally, an operational data source consisted of highly structured data managed by the line of business in a relational database. But as the world changes, it is important to understand that operational data now has to encompass a broader set of data sources.

View Article
page 1
page 2
page 3
page 4
page 5
page 6
page 7
page 8
page 9
page 10
page 11
page 12
page 13
page 14
page 15
page 16
page 17
page 18

Quick Links

  • About For Dummies
  • Contact Us
  • Activate Online Content

Connect

About Dummies

Dummies has always stood for taking on complex concepts and making them easy to understand. Dummies helps everyone be more knowledgeable and confident in applying what they know. Whether it's to pass that big test, qualify for that big promotion or even master that cooking technique; people who rely on dummies, rely on it to learn the critical skills and relevant information necessary for success.

Copyright @ 2000-2024 by John Wiley & Sons, Inc., or related companies. All rights reserved, including rights for text and data mining and training of artificial technologies or similar technologies.

Terms of Use
Privacy Policy
Cookies Settings
Do Not Sell My Personal Info - CA Only