Business

Can algorithms reinforce our biases?

Classification is the task of classifying a bunch of objects into separate groups.

Reported By:| Edited By: Arun Krishnan |Source: DNA |Updated: Mar 22, 2018, 03:06 AM IST

Arun Krishnan

A recent Harvard Business Review (HBR) interview on misuse of algorithms got me thinking about whether the algorithms we use can reinforce our biases. However, before I explain why I feel that way, I would explain how machine learning algorithms work to classify datapoints into multiple classes. In order to do so, I will step back to explaining what classification means.

Classification is the task of classifying a bunch of objects into separate groups. Say, an organization wants to build a prediction engine that would be able to separate out the excellent performers from the average and the unacceptables. This then, is a classification task since the algorithm is required to go through the list of employees and put them in one of three buckets, viz., excellent, average and unacceptable. Now, data scientists would use a machine learning approach. Simply put, machine learning algorithms look for patterns within datasets and learn these patterns corresponding to each of the buckets into which they need to classify items.

Hence, using our example, the algorithm would learn the patterns for excellent, average and unacceptable performers. These algorithms then require a lot of historical data. Essentially, we need to 'show' these algorithmic examples of excellent, average and unacceptable performers. This process is known as 'training' the algorithm. This in effect is similar to how we teach kids the alphabet where we show them the letters again and again and correct them when they get it wrong so that ultimately they are able to recognize the characters. This is exactly what happens when we train our algorithm. This allows the algorithm to then identify 'signatures' for each of these buckets. When it is then presented with a different datapoint, it compares the new data against the learnt patterns and if there is a lot of similarity between the two, it assigns the new datapoint to the corresponding bucket.

So what can go wrong, you ask? To train the algorithm, we need to provide it with a lot of examples to learn from. Let us assume that we are looking at ABC Inc, a traditional and a conservative business based out of Bangalore. Most of the people employed in ABC have come from the southern part of the country. Moreover, given the conservative attitudes of their managers, women are under-represented in this organization in general and at higher levels in particular. ABC now wants to build a model on the best people to hire for their organization and for this, they have made available their historical employee database. This is where the problem is. All their biases are part of their historical dataset. Hence, any models that will be trained on this dataset will also inherit these biases. As an extreme case, if we now have a woman applicant from Delhi, she might well get filtered out by the algorithm since she doesn't fit the typical profile that the model has been trained on. While this is a gross simplification of the issue, it does illustrate the pitfalls associated with building models based on historic data without taking into account biases inherent in the data.

Given that most organizations will have some biases, be they about gender, age or educational institutions, how can they build predictive models if these biases are implicitly included in models built on this dataset? There are no easy answers here. One way is to utilize publicly available datasets or pool with other organizations in the space to obtain datasets that normalize the biases. However, whatever be the approach, any data scientist worth her salt ought to be aware and look for these built-in biases before undertaking any model building.

LIVE COVERAGE

Speed Reads

IPL 2024: Virat Kohli, Rajat Patidar fifties and disciplined bowling help RCB beat Sunrisers Hyderabad by 35 runs

Most Watched

Arvind Kejriwal Arrest: Another Setback For Delhi CM, Court

What Rakul Preet Chooses 'Marriage' Or 'Career' | Rapid Roun

RR vs LSG Highlights: Sanju Samson Shines, Rajasthan Royals

Lok Sabha Elections: BJP Releases 10th Candidate List, Drops

Team India Captain Rohit Sharma Reveals His Retirement Plans

Can algorithms reinforce our biases?

Classification is the task of classifying a bunch of objects into separate groups.

LIVE COVERAGE

TRENDING NEWS TOPICS

Popular Stories

SC to pronounce verdict today on petitions seeking 100% EVM-VVPAT verification

Meet actor, who was thrown out of house, called rapist, mocked by relatives, insulted publicly, later became India's...

Amitabh Bachchan begins Kaun Banega Crorepati 16 shoot, reveals working 8 hours without break, shares pics

Delhi MCD mayoral polls, scheduled on April 26, postponed due to...

Meet Isha Ambani's lesser-known relative who owns Rs 6368 crore business, Mukesh Ambani is his...

Most Viewed

Actors who died due to cosmeti...

See inside pics: Malayalam sta...

In pics: Salman Khan, Alia Bha...

Streaming This Week: Crakk, Ti...

From Salman Khan to Shah Rukh ...

Speed Reads

IPL 2024: Virat Kohli, Rajat Patidar fifties and disciplined bowling help RCB beat Sunrisers Hyderabad by 35 runs

Delhi MCD mayoral polls, scheduled on April 26, postponed due to...

SC to pronounce verdict today on petitions seeking 100% EVM-VVPAT verification

Meet engineer, IPS officer's daughter who cracked UPSC to become IAS without coaching, married to an IAS, her AIR was…

Yavatmal-Washim Lok Sabha constituency: Check polling date, candidates list, past election results

Most Watched

DNA Originals

DNA TV Show: Analysis of child traffickers' modus operandi in Delhi

DNA Exclusive: Ugly 'Car-Nama' exposed! Showrooms charge extra for delivering cars of high waiting period

DNA Exclusive: India's first Twitter user on her 16-year Twitter journey and Elon Musk's 'adventures'

DNA Exclusive: Ashok Gehlot to ‘lead’ Congress? Are Gandhis trying to hit three birds with one stone?

DNA Exclusive: As Gandhis remain 'reluctant', should Congress finally get a president from outside the family?