Unwelcome…
… are platform instability, downtime, {hardware} failure, poor efficiency, cluster useful resource rivalry, repeated course of failures, runaway reside queries, important companies alarms, invisibility into alarm cacophony… the checklist goes on. If these are illnesses you want to treatment …
Welcome!
To this six-part sequence, the place we’ll have a look at the right way to get management of the well being of your Cloudera Knowledge platform (CDP) surroundings. Out of the field, CDP performs fantastically, however over time, if knowledge structure, knowledge engineering, and DevOps greatest practices usually are not maintained, the Knowledge Metropolis you’ve erected atop a stable CDP bedrock can turn into the wild, wild, west. Maybe it’s time for some legislation and order to stop additional crimes towards the tech.
Greater than a case research, we’ve interwoven greatest practices gleaned from a number of configurations and consumer websites right into a complete, simple to know set of directions to diagnose and resolve most of the points that adversely impression CDP environmental well being.
With every weblog we’ll define the signs and root causes of frequent environmental well being challenges and prescribe options. The place we will, we’ll embrace helpful hyperlinks to step-by-step directions to information you thru profitable implementation. After we conclude the sequence, we’ll share a homegrown software, an environmental well being scorecard, to observe and handle the well being of your surroundings.
There are a lot of, many causes that an surroundings might carry out poorly, and definitely some resolutions take effort and time, however there may be fairly succulent low hanging fruit. Our nice hope is that you just discover impactful fast wins that encourage you to pursue a number of avenues of well being enchancment. You may additionally resolve to companion with our Cloudera Skilled Providers workforce who greater than doubled a buyer’s well being rating in two quick quarters.
Classes of CDP Environmental Well being
We’ve categorized points of environmental well being for this sequence.
Visibility and Transparency
Into the cluster, platform, companies, and processes. We gained’t have the ability to make a lot progress if we shouldn’t have correct visibility into the issues. That’s observability. On this weblog we offer directions and instruments on the right way to achieve visibility, suppress alarm noise, discover and analyze the foundation causes of essentially the most vital alternatives, and proactively notify your customers when incidents happen
Knowledge Asset Standardization
Of frequent datasets, pipelines, processes, and stories. Admittedly, knowledge asset standardization is a multiyear journey; however, addressing solely your most problematic and resource-intensive processes and belongings might yield extra environmental well being enchancment than another class. We’ll share greatest practices on the right way to find and capitalize on these alternatives.
Platform Well being
Consists of {hardware} and companies settings and configurations. Cloudera Knowledge Platform (CDP) have to be configured correctly to perform effectively with excessive efficiency. Moreover, as enterprise wants regularly change, so will your use of the platform, and that may necessitate re-tuning. That will help you on that journey, we’ll checklist some frequent signs, hyperlink them to root trigger evaluation steps, present correct configuration pointers, and description the steps to correctly tune your surroundings.
The Proper Device for the Job
Consists of the right use of Impala, CDSW, Airflow, Nifi, and CM. You is likely to be stunned on the adversarial environmental impression of utilizing CDSW as an ETL pipeline software or utilizing Impala to write down unwieldy queries with an embarrassing variety of joins. We’ve finished it too. We confess. We’ll spotlight the benefits of utilizing Airflow to handle complicated knowledge pipelines with its facility to divide workflow into small unbiased duties. We’ll checklist different do’s and don’ts.
Environmental Well being Scoring
Brings all of it collectively by demonstrating the right way to measure, rating, monitor, and management environmental well being by way of dashboards that we offer for you together with directions to hook them as much as your logs.
For those who’ve acquired the signs, the docs are in. Let the therapeutic start!