Fern Halper

Dr. Fern Halper specializes in big data and analytics.

Articles & Books From Fern Halper

Cheat Sheet / Updated 02-09-2022
To stay competitive today, companies must find practical ways to deal with big data — that is, to learn new ways to capture and analyze growing amounts of information about customers, products, and services.Data is becoming increasingly complex in structured and unstructured ways. New sources of data come from machines, such as sensors; social business sites; and website interaction, such as click-stream data.
Article / Updated 03-26-2016
Big data is most useful if you can do something with it, but how do you analyze it? Companies like Amazon and Google are masters at analyzing big data. And they use the resulting knowledge to gain a competitive advantage. Just think about Amazon's recommendation engine. The company takes all your buying history together with what it knows about you, your buying patterns, and the buying patterns of people like you to come up with some pretty good suggestions.
Article / Updated 03-26-2016
The best way to understand the economics of big data is to look at the various methods for putting big data to work for your organization. While specific costs may vary due to the size of your organization, its purchasing power, vendor relationships, and so on, the classes of expense are fairly consistent. Big data types and sources The most important decisions you need to make with respect to types and sources are What data will be necessary to address your business problem?
Article / Updated 03-26-2016
The fundamental structure for graph databases in big data is called “node-relationship.” This structure is most useful when you must deal with highly interconnected data. Nodes and relationships support properties, a key-value pair where the data is stored. These databases are navigated by following the relationships.
Article / Updated 03-26-2016
To fully understand the capabilities of Hadoop MapReduce, it’s important to differentiate between MapReduce (the algorithm) and an implementation of MapReduce. Hadoop MapReduce is an implementation of the algorithm developed and maintained by the Apache Hadoop project. It is helpful to think about this implementation as a MapReduce engine, because that is exactly how it works.
Article / Updated 03-26-2016
Clearly, the very nature of the cloud makes it an ideal computing environment for big data. So how might you use big data together with the cloud? Here are some examples: IaaS in a public cloud: In this scenario, you would be using a public cloud provider’s infrastructure for your big data services because you don’t want to use your own physical infrastructure.
Article / Updated 03-26-2016
Cloud computing is a method of providing a set of shared computing resources and is becoming increasingly important for your big data initiative. The cloud includes applications, computing, storage, networking, development, and deployment platforms, as well as business processes. Cloud computing turns traditional siloed computing assets into shared pools of resources based on an underlying Internet foundation.
Article / Updated 03-26-2016
When people talk of map and reduce in big data, they do so as operations within a functional programming model. Functional programming is one of the two ways that software developers create programs to address business problems. The other model is procedural programming. Take a quick look to understand the differences and to see when it's best to use one or the other model.
Article / Updated 03-26-2016
Cloud computing has evolved in recent years. The new world of the hybrid cloud is an environment that employs both private and public cloud services. Companies are realizing that they need many different types of cloud services in order to meet a variety of customer needs. The growing importance of hybrid cloud environments is transforming the entire computing industry as well as the way businesses are able to leverage technology to innovate.
Article / Updated 03-26-2016
Business Process as a Service (BPaaS) is any type of horizontal or vertical business process that’s delivered based on the cloud services model. These cloud services — which include Software as a Service (SaaS), Platform as a Service (PaaS), and Infrastructure as a Service (IaaS) — are therefore dependent on related services.