Wait, what? We were doing ELT the whole time??

May 13, 2024September 18, 2024 / mr ben

Some discussions around data architecture and modern data platforms are somewhat over simplistic. For example, if you asked someone to explain the difference between ETL and ELT you might get the following:

“ETL is old, ELT is new”
“ETL is bad, ELT is good”
“ETL is databases, ELT is data lakes”

Let’s actually understand the terms first with the help of the AWS blog, which has a lot of great content.

Firstly, AWS points out that the E (Extract) is the same in both cases – happy days, we can move on.

Now, using both the above diagrams and below table we can compare the T (Transform) and L (Load).

Once I had taken this information in, I started to think back to all the SQL Server based Data Warehouses I have worked on over the last 15 years or so.

I started to realise those architectures followed the second image; SSIS would copy (Extract & Load) the source system data “as is” to a staging database on the Data Warehouse server, and we would use Stored Procedures to transform that data into dimensions and facts.

It is entirely possible that some people used SSIS (or other ETL tools) and transformed the data mid-flight and loaded it directly into their star schema, but these were rare as Stored Procedures were often simpler, faster and more elegant.

So although the ELT in the world of Fivetran, semi-structured data and Spark may look quite different to what we were doing ‘back in the day’, if you look at the “dictionary definitions” of the terms, you will see that often, it’s not a fundamental departure from where we were.

Boost productivity with ‘Solution Summary Cards’

May 3, 2024May 3, 2024 / mr ben

This blog post first appeared on the simple talk site:
https://www.red-gate.com/simple-talk/blogs/solution-summary-cards/

Solution Summary cards are the name I have given to simple 1-page documents provided to developers to get up-to-speed on a particular solution.

The primary use-case for these cards is to provide better on-boarding for new team members.

An additional benefit is that crafting these short summaries forces people to review how your teams are working and you will often start seeing areas which would benefit from standardization or simplification.

An example of one these cards, in this case for a simple Data Warehouse implementation, can be seen below:

Continue reading →

The good & bad of BI maturity models

May 18, 2022May 3, 2024 / mr ben

I wanted to discuss two things that get used a lot in Data & Analytics, although I’m sure they exist in many other areas.

Maturity models – aka capability models
Maturity assessments – aka capability assessments

Maturity models are just a static image; a diagram explaining the spectrum of what can be done within a particular discipline.

Maturity assessments are surveys that require you to answer a range of questions about your organisation, they then generate a document/report that tells you your areas of strengths and weaknesses with some suggestions on which areas you should focus on first.

Some examples

Above, an example Maturity Model.

With this particular example, my opinion is that the exponential curve is inverted. For most organisations, moving from “1.0 Frustrated” to “2.0 In Control” is where you will see the biggest jump in “competitive advantage”.

Continue reading →

ITIL : A view from the Trenches

August 20, 2019 / mr ben

A while back I stumbled across a great blog post from a guy called Greg Ferro with his musings on ITIL. It’s a very good read in its own right and can be found here.

Unusually however, the pages of discussion and comments underneath the article, arguing both for and against ITIL, contained just as much insight as the main article itself. A lot of people felt so passionate about the subject that they took the time to relay their own personal experiences in a really considered and articulate way.

This slideshow below is just a collection of some of the best comments against ITIL – I’m not trying to be balanced here. They are largely posted ‘as is’, but some comments have been combined on the same slide just to use the space wisely.

Btw, I’m not personally commenting on ITIL’s merits – but as ITIL is seemingly ubiquitous in organisations everywhere, it is rarely challenged and doesn’t seem to have a feedback loop – which is one of the reason’s I found the post and comments so interesting.

So here you go…

Nobody Gets Hurt from Ben Brown

rightly or wrongly, a lot of people feel this way about ITIL

Assess your dev teams with the Ben Test

December 12, 2018December 13, 2018 / mr ben

Introduction

The original Joel Test is a work of genius – it allowed you to assess the quality of your development teams in under 5 minutes. By answering the yes/no questions, it steered you into thinking through each of the points and understand weak points in your process.

It was written in August 2000 (that’s before Windows XP was even released) and is still very relevant today, which suggests that a lot of IT teams are still making the same mistakes they were making 18 years ago.

I’ve heard one developer sum up the Joel Test perfectly when he said, “the beauty of the Joel Test is its simplicity vs its effectiveness”.

Just as the original did, these tests deliberately cover the basics. I have intentionally not included tests such as “are you agile?” or “do you do DevOps?” as a lot of people misunderstand these concepts and unfortunately they sometimes get thrown around as management buzz-words.

Just for the record, I have worked mainly as a data warehouse & BI developer for financial services firms in London, within IT teams ranging from 2 people to 100s; if I had spent my career working as a web developer for Facebook or Amazon then I’m sure my tests would be quite different.

TheBenTest_

Remember, Yes = 1 point. No = 0 points.

11 or 12 points	Keep doing what you’re doing
8, 9 or 10 points	Keep going but analyse where you can gain efficiencies
7 or under	Stop what you’re doing, call a team meeting and make a plan

01 – Does your team have a goal… and do all your team members know what it is?

Do your team members also know what their individual goals are and those of the company?

This is universal and applies to any team/individual and is not really about having goals – which all companies will have – it’s about whether or not they are being communicated down the layers and making sure everyone is striving for the same success.

Benefits of having clearly understood goals

Much more cohesion between teams when people pull in the same direction.
Goal-relevant activities take up a lot more of the overall effort than goal-irrelevant activities.
Managers cannot motivate teams by assigning out a series of seemingly unconnected tasks.

Continue reading →

An Example Tableau Security Model

November 2, 2018November 2, 2018 / mr ben

My experience navigating Tableau security as a novice…

I recently upgraded a Tableau 10.1 estate to Tableau 2018.1. I used the opportunity to completely rework the security from the ground up.

When starting out, much of the guidance I found on the net was focused on the many individual components that make up a Tableau estate.

While I’m certainly not claiming what I have done is best practice, I hope it will trigger some ideas and serve as a starting point for your own implementations.

This guide doesn’t cover licensing although that is something which is definitely worth understanding if you are implementing a Tableau security model. If you need to learn about Tableau licensing then this article does a great job of explaining both the old and new models.

Continue reading →

Top 10 concepts from Netflix’s culture of ‘Freedom and Responsibility’

October 17, 2018October 17, 2018 / mr ben

Back in 2009 Netflix released a slide deck called ‘Freedom & Responsibility’ that explained some of their strategy and culture.

Facebook COO Sheryl Sandberg said that it “may well be the most important document ever to come out of the Valley”.

I first heard about it a few years back when I read Dave Coplin’s brilliant (and succinct) book Business Reimagined, which you can download here for free.

‘Freedom & Responsibility’ was something that evolved at Netflix over many years. Here are what I consider to be the 10 most notable ideas from the document.

Continue reading →

Software Development & Broken Window Theory

August 23, 2018August 24, 2018 / mr ben

broken_window

Broken Window theory goes something like this:

Some broken windows are left unrepaired in a neighbourhood…
People see this state of disrepair and feel like no one cares about their surroundings…
Because nobody cares, people feel like they can cause further damage without repercussion…
Further damage is done, perpetuating the cycle.

We see this exact same pattern in all areas of life including software development.

Continue reading →

Data Dictionary

August 23, 2018August 24, 2018 / mr ben

Data Dictionary w/ Search Functionality (2016)

At CNA-Hardy, I put together a data dictionary for an Underwriting/Actuarial MI system I looked after.

I created the data dictionary in Excel and put a search facility built in. With around 250 calculations and attributes it made understanding and troubleshooting a lot easier.

data-dictionary-img

The .xlsx version of the tool can be seen here.

The .xlsm version which also filters the rows can be found here.

Technical Debt

October 7, 2015August 24, 2018 / mr ben

Technical debt is a metaphor that equates software development to monetary debt. In my opinion it is one of the most crucial concepts to be aware of when planning projects or road-maps.

Imagine that you have a project that has two potential options; one is quick and easy but will require modification in the future, the other has a better design but will take more time to implement.

In development, releasing code with a ‘quick-and-dirty’ approach is like incurring debt – it comes with the obligation of interest, which, for technical debt, comes in the form of extra work in the future.

Just like monetary debt, technical debt is interest-bearing and compounds. You always have the option to pay down the debt (long-term thinking) or to take out additional credit (short-term) but your project can become insolvent where the only option is to write-off the debt (re-write from scratch).

To summarise, it is a debt that you incur every time you avoid doing the right thing like removing duplication or redundancy. It will add an overhead to everything you do from thereon in, whether that is troubleshooting, maintenance, upgrading or making changes.

[Some parts taken from MartinFowler.com and Techopedia]

Ben Brown

Data Architect

General

Wait, what? We were doing ELT the whole time??

Boost productivity with ‘Solution Summary Cards’

The good & bad of BI maturity models

ITIL : A view from the Trenches

Assess your dev teams with the Ben Test

An Example Tableau Security Model

Top 10 concepts from Netflix’s culture of ‘Freedom and Responsibility’

Software Development & Broken Window Theory

Data Dictionary

Technical Debt