Every wondered how Machine learning works? In this 2 part article we are going to explain machine learning with Excel. If you missed the first article and video you can read it here.
Iterations
Machine learning algorithms iterate the data over and over until it can establish the best route to take with the questions. For example, it will remove data where the entropy is 0 and re run the calculations to find the next question where entropy of 0 is found. Entropy of 0 on a decision tree is the final or decision leaf on a decision tree. If entropy of greater than 0 is found, the item with the most information gain is taken to be the next question and the iteration is re done on an amended data set depending on the answers.
Looking at our sample data, in our first iteration, entropy is calculated on each classification and the data set. The first question is defined by the classification that gives the most information gain or if the entropy is 0.
After this, any data with an entropy of 0 is removed, and then the algorithm can run again. The data that is removed is the data in which a decision can be made on. This will continue running and branching until the best decision tree route is defined.
Entropy and information gain can be used to created decision trees from a top down approach. Below is the path chosen using the sample training data and an entropy and information gain model to the second iteration. Watch this video now where we will talk though the algorithm process and iterations to show how this has been achieved.
Conclusion
Banks and the finance sector are using ML to decide if a person is credit worthy day in and day out. The data is ever growing, and improvements have been made with the addition of new data. What we have seen in this example is a bank with a list of 3 questions. By using these questions, along with the history on loan defaults, using probability, the best route for questioning in a decision tree can be established.
In real life, the questions asked by a bank are more complex, and the training data set a lot bigger, and this decision tree quickly can become a forest with many many iterations over the data to reach decision or leaf nodes.
A decision tree and entropy/information gain calculations are only one example on how decision trees can be calculated. The bones behind most Machine learning algorithms are based on probability calculations in some form. It is therefore important that you have a good understanding of statistics and probability if you want to create machine learning algorithms. Remember decision trees are only one form of machine learning algorithm, as the aim of this article was to give you an understanding on how Machine Learning and algorithms work
Don’t have a Hive wallet or a Steempress Account?
I would suggest that you sign up directly for your own hive wallet and sign into steempress using your wallet. this way all rewards will be paid directly to your wallet within 7 days. You can use this link to sign up now for your Hive wallet
>> GET HIVE WALLET NOW<<
If you sign up using the comments section below you will get a Steempress account. Steempress will hold any rewards you earn until you have a hive wallet.
Have questions? Please use the Hive powered comments section below and we will do our best to help you. Alternatively, you can contact us with this link.
Like what you see? I do hope that you will share this article across your social profiles
Sign up for my newsletter – Don’t worry, I wont spam. Just useful Excel and Power BI tips and tricks to your inbox
SIGN UP
Community Invitation
Cross posted from my blog with SteemPress : https://theexcelclub.com/simple-explanation-of-machine-learning-shown-with-excel-part-2/
Thank you for the introduction to ML through decision trees, entropy, and information gain. I enjoyed the bank example.
you are most welcome, I am glad you liked it.
OK, I have to admit that I was disappointed there was no practice exercise. :-) I was so looking forward to working through a decision tree.
Do you have any other examples to recommend for working through a decision tree in Excel using the concepts you shared?
Hi @paulag ,
I want to ask you as an excell expert.
What would be the easy way (for noobs like me) to implement the SHA256 hashing function into a excel worksheet?
I want to do some tests and learning with crypto coin alghoritms .
Hashing is so powerfull functon, but i can not find way how to apply some automation . Got tired doing copy-paste all the time my data into dedicated applications.
Thanks.
PS
I am with the same nick name on Discord, Telegram, Slack, FB, Twitter
hi and thanks for the question.
for noobs its a little difficult as it would require a VBA code to be written. There is a sample here in this post that I found
https://superuser.com/questions/550592/is-there-an-excel-function-to-create-a-hash-value
Thanks.
I think I've seen this , but had a hopes there might be a more simple ways... :)
I always remember a joke of my professor which says:
There are two types of people in this world.
Does who understand iterations and does who understand iterations.
Posted using Partiko iOS
well lets hope this series helps explain iterations and also the part the play in machine learning. thanks for visiting
For sure...
People think that machine learning is kinda difficult to understand, however, when you fully understand the iterations everything becomes more easily to understand.
Cheers!