How to properly train artificial intelligence.

If correctly programmed, artificial intelligence can relieve a lot of human workload. But things do go wrong whenever it has been trained using the wrong data. So how does AI learn to do the right thing?

Aug 20, 2019

If you train a self-learning algorithm using the wrong data there’s a risk of mishaps.
“Only clean data will prevent machines from making the wrong decisions.”
Measurable successes at EOS: incoming payments raise up eleven percent thanks to the use of AI.

The “hotness” filter in photo-editing app 'Faceapp' shows what happens when a self-learning algorithm is trained using the wrong data. Two years ago, photos of dark-skinned people were suddenly lightened to make them whiter. The reason for the change in skin color was that the AI had been trained using only one dataset containing light-skinned Caucasian faces. If the AI training had taken account of all ethnic groups this mishap would not have occurred.

Disadvantaged by poor data.

Someone who knows how to correctly train AI systems is Andreas Dix from the Data Science Team at EOS in Germany. The data specialist trains machines for repetitive and time-consuming processes. “Only clean data prevents machines from making wrong decisions.”

“We need to know exactly where the relationships are so that artificial intelligence works properly based on our training”

Andreas Dix

One way to avoid these mishaps is through proper data exploration. That means approaching the dataset without hypotheses, i.e. impartially and without unconfirmed assumptions. Afterwards, the expert tries to find out what kind of usable information the dataset contains. Are there variables in it that do not have any dispersion? Or does it include variables that have too many missing values? These data should be excluded because they can have an incorrect influence. “We need to know exactly where the relationships are so that artificial intelligence works properly based on our training,” says Dix.

Intelligent programming.

The machine learning algorithms need clean data to recognize structures and draw conclusions. “The rules and conditions set up by the algorithms during the training must not be too specific, because they then will have no value at all for really predicting something. This is then called overadaptation. It would be better to generalize, i.e. find fewer specific structures and as a result achieve good accuracy, with newly acquired data as well.” This can be achieved, for example, by optimizing the hyperparameters of the algorithm and through more training data.

Data training: A man stands in an office holding a stopwatch

Fully automated receivables.

In relation to debt collecting activities at EOS, this means for example that AI can predict the best collection step to be taken next. Specifically, the data existing in the system up to this point about the receivable itself and the defaulting payers are collected, aggregated and prepared. Only then are all models queried with this data, to predict how successful each collection activity will be for this particular receivable at this point in time. Or to put it more clearly: how much payment inflow can EOS expect. Finally, the activity that is rated as the best option after applying all criteria will be executed by the debt collecting system.

Measurable successes.
Measurable successes have already been recorded thanks to the use of AI at EOS. “At EOS in Germany we have been making productive use of the data-driven AI system D3, Data Driven Decisions. We use it to control the collection process with the result that payment receipts are up around 10 percent. This means that we are achieving five percent higher earnings after activity costs are deducted compared with our previous receivables processing system,” says Dix.

Human intelligence is the most important of all.

When asked whether human beings could at some point become superfluous to these processes because machine learning programs take on a life of their own, the data specialist pauses briefly: “I think that in the end, systems with artificial intelligence are a useful complement for human input. But the human being who controls the process and takes important decisions is still the most important factor.” After all, it is the human being who has to feed the machine with the correct data.

Please contact us if you would like more information.

Press contact

Phone: +49 40 2850-1222

presse@eos-solutions.com

Photo credits: Achim Multhaupt

Tool name		Cookieconsent_status
Tool provider	EOS Holding GmbH
Address of tool provider	Steindamm 71, 20099 Hamburg, Germany
Tool description	Essential cookie to save consent banner inputs.
Data processed	None
Purpose of data processing	To save consent
Retention period	60 days

Tool name		Java Session Cookie
Tool provider	EOS Holding GmbH
Address of tool provider	Steindamm 71, 20099 Hamburg
Tool description	Randomly generated session number essential for the proper functioning of the application software.
Data processed	None
Purpose of data processing	Proper functioning of website
Retention period	Session cookie – is deleted after you have closed your browser.

Tool name		Visitor
Tool provider	EOS Holding GmbH
Address of tool provider	Steindamm 71, 20099 Hamburg, Germany
Tool description	We use this cookie to make it easier for you to use this website.
Data processed	None
Purpose of data processing	Optimization, improvement of service
Retention period	Session cookie – is deleted after you have closed your browser.

Tool name	NEW_Visitor
Tool provider	EOS Holding GmbH
Address of tool provider	Steindamm 71, 20099 Hamburg, Germany
Tool description	We use this cookie to make it easier for you to use this website.
Data processed	None
Purpose of data processing	Optimization, improvement of service
Retention period	1 day

Tool name		nmstat
Tool provider	Siteimprove GmbH
Address of tool provider	Rosenheimer Str. 143 C, 81671 Munich, Germany
Tool description	This cookie contains an ID character string for the current session. It contains non-personally identifiable information about which sub-pages the visitor enters – this information is used to optimize the user experience.
Data processed	None
Purpose of data processing	Analysis, statistics
Retention period	399 days

Tool name		Facebook Pixel
Tool provider	Meta Platforms Ireland Limited
Address of tool provider	4 Grand Canal Square, Grand Canal Harbour, Dublin, D02, Ireland.
Tool description	Used by Meta to serve ads, measure and improve ad relevance, and offer advertising products on Meta.
Data processed	Ads Viewed \| Pages Visited \| Browser Information \| Facebook Cookie Information \| Facebook User ID \| Geographic Location \| Device Information \| HTTP Header \| Interactions with Ads, Services and Products \| IP Address \| Marketing Information \| Usage Data & Behavior \| Pixel ID \| Referrer URL \| User Agent
Purpose of data processing	Analysis \| Conversion Tracking \| Marketing \| Social Media \| Advertising
Retention period	__fbp (duration: 3 months)

Tool name		LinkedIn
Tool provider	LinkedIn Ireland Unlimited Company
Address of tool provider	LinkedIn Ireland Unlimited Company Wilton Place, Dublin 2, Ireland
Tool description	This cookie is used to obtain anonymized reports about the website target audience and the possibility of targeted advertising, e.g., in the context of retargeting.
Data processed	Device information, browser information, IP address, referrer URL and time stamp.
Purpose of data processing	Marketing, analysis, retargeting
Retention period	Cookie Name (duration: 90 days)

How to properly train artificial intelligence.

Disadvantaged by poor data.

Intelligent programming.

Fully automated receivables.

Human intelligence is the most important of all.

Please contact us if you would like more information.

Necessary

Comfort

Statistics

Marketing

EOS Holding GmbH