This blog aims to explain the process of launching a webserver using containerization technology(Docker) and the DevOps tool(Ansible) for automation!

Image for post
Image for post
Image Source

Automation is been widely adopted & in huge demand these days due to multiple reasons, & DevOps is the key to achieve automation. Some of the reasons for adopting automation are that it enables parallel work to a very large extent, saves time in which multiple other works can be done, is less prone to errors, & saves money, etc.

Automation has established itself in the world completely, everyone is chasing it, there is a huge demand for automation, & that is the reason I have writing this blog to provide a glimpse of the automation world.

Before directly jumping to the practical, let us discuss something about Containerization & Automation tools that are used in the practical. …


This blog aims to explain the difference between the Probability & the Likelihood. This topic is very important to understand, but the problem here is that both the topics are very confusing to understand. That is why, I am writing this blog to remove the confusion, & I will explain the topics in a simple manner as possible.

Image for post
Image for post
Image Source

I am very much confident that you must have encountered the terms “Probability” & “Likelihood” in your daily life, but you must have found those terms very much confusing & almost similar. For the very first time, if anyone is trying to understand these terms, it might feel like they both are similar, it is difficult to spot/understand the difference between the terms.

No worries, you have come to the right place, this blog will guide you to understand the difference between the Probability & the Likelihood.

Important note!

The biggest problem which restricts someone to understand the concepts of Data Science is the wrong approach towards learning & understanding it. …


This blog aims to explain an effective way to calculate the correlation between the features of a dataset which in turn will help to not only select specific features to improve the model training(remove the curse of dimensionality), but it will also help in improving the model performance.

Image for post
Image for post
Source

In every data science project, Feature Engineering is a very important aspect that needs to be done in order to make an effective model. In any Data Science project, it is very important to select minimum features that are relevant to the target variable/output.

For Feature Selection, there are various techniques, among those techniques, finding correlation is very famous & widely adopted. Finding a correlation between the features of the dataset is a very interesting and important aspect.

I would request to all the readers of this blog, please read my blog on Covariance(if you haven’t already), it will build you fundamentals on correlation, & it will also help you to understand the drawbacks of Covariance which leads us to use Pearson Correlation. …


This blog aims to explain the Covariance which is a very important topic in Feature Engineering in Data Science. In addition to that, this blog will also cover its use-cases, advantages & disadvantages.

Image for post
Image for post
Source

Data Science is a very hot topic at present. Most of the pursuing students are selecting this field as their profession, in addition to that, many corporate guys are also shifting towards this technology by seeing the scope of this field.

Since Data Science is very much famous & a hot topic, that is why it is attracting most people which is an amazing thing, but in contrary to that, most of the guys, when they start learning in this field, they have a feeling to learn it as soon as possible. …


This blog aims to implement the webserver high availability architecture on AWS using AWS CLI. Furthermore, additional EBS storage will be used to make the architecture permanent/highly available, S3 service will be used to store the static objects for the Webserver. Moreover, CloudFront will be used for CDN services.

Image for post
Image for post
Source

Considering any business, high availability is the core requirement, irrespective of the type of business, if the business has to grow, then it has to be available, otherwise business can not grow if the business products are not available.

Also, in addition to availability, there is one more factor that is “how fast the services of the business can be accessed?”, this factor also plays a very important role in business growth in today’s rapid world.

For example, consider online e-commerce company “Flipkart”, if the site of Flipkart is not available then, then Flipkart can not grow, moreover it will vanish from the e-commerce market in few weeks even because it is not available. Now, coming to the speed of availability of the services of Flipkart, consider a case in which the site of Flipkart takes too much time to load, in this case even, Flipkart’s customers will stop accessing the website because they will be irritated/annoyed with this latency in the website loading. …


This blog aims to explain the very famous encryption technique i.e. RSA, by covering every related topic like asymmetric encryption, block cipher, public key & private key.

Image for post
Image for post
Source

Everyone is concerned about security today because, in this digital world, where everything is online, chances of fraud are very much high. That is why everyone is concerned about security. Most of the organizations today, provide specialist security jobs because they want the utmost security for their data.

In order to learn advanced security concepts, everyone should be familiar with basic cryptography algorithms, & RSA is one of that.

Brief about Asymmetric Encryption!

It is an encryption technique that involves 2 keys, one public & one private. In asymmetric encryption public key is available to everyone, but the private key is available to only the person/machine which has to decode the message. Since 2 keys are used in asymmetric encryption, that is why it is considered to be a stronger encryption technique as compared to symmetric encryption where only 1 public key is used for both encryption & decryption. …


This blog aims to explain the most confusing concepts in feature engineering which are Standardization & Normalization. Both look very similar, & most of the time, most of the people fail to understand the difference between them, & the use-case for each of them. But, no worries, this blog will act as a helping hand to make everyone understand the difference between them & their use-cases.

Image for post
Image for post
Source

It's completely fine if you feel confused between the topics “Standardization” vs “Normalization”. A few months ago, I was one of you, therefore, I can completely understand this feeling of confusion & sometimes frustrated too, because there was no good & easy resource to explain the topic.

But, there is no need to worry because this blog will not only clear all the doubts between these topics but also provide their use-case i.e. when to use which.

Important Prior Knowledge!

Before explaining the difference between “Standardization” & “Normalization”, let me build the context for that.

Standardization & Normalization, both are part of Feature Engineering which in turn a part of Data Science.


This blog aims to explain the fundamentals of Machine Learning, which are Bias & Variance. These are the terms whose understanding is very much important for future learning in this field because everything in the machine learning field depends on this only.

Image for post
Image for post
Source

I am pretty much sure, that most of you guys might have heard about these terms, but you are confused in these terms as they are also available in multiple other fields like statistics.

How these terms are different in Machine Learning & what is their role in Machine Learning? If you have these questions, or even if you have not heard these terms at all, don’t worry, this blog will guide you to the exact meaning of these terms with practical & diagrammatic explanation.

Introduction to Bias & Variance!

Both of these terms are elaborated in the general visualization of a graph. Let’s take an example of a very easy Machine Learning algorithm i.e. Linear Regression, whenever a graph of Linear Regression is plotted, both of these terms Bias & Variance are observed, & depending on them the general ability of the Model to predict in future is identified. …


This blog aims to provide the information/case-studies about the companies which got huge benefit by using AI/ML.

Image for post
Image for post
Source

We are living in a world today which is surrounded by a lot of AI devices, even though the camera of the phone also uses AI. Most of the devices today available in the market use Machine Learning to make their products capable of learning the trends of today’s generation.

Even the motorbikes today come with Machine Learning to support today’s technology. Self-driven cars are also one of the examples where the AI role is present.

The situation around us is like if you see anywhere around you, you will most probably find some devices which are using AI or ML. …


This blog aims to explain the process of setting up the Hadoop version 1 multi-node cluster. Anyone after going through this blog will be able to deploy the multi-node cluster of Hadoop v1 effectively with all the resources for the proper functioning of the program.

Image for post
Image for post
Source

In this Big Data world, using a Distributed Storage filesystem is a must. Due to huge data, it is very difficult for anyone to cope up without taking the help of some distributed filesystem.

Brief about Hadoop!

Hadoop is one of the most popular Distributed filesystems, & it is widely adopted & used by multiple companies today. The hard reality is, Hadoop is been used at multiple companies, but there is no such proper article that explains the exact procedure of installing it. Moreover, multiple people & teachers even in the colleges say that, if you are already getting Hadoop installed by various distributions like Cloudera or HortonWorks, then you should not know the process of installation. …

About

Harshit Dawar

Big Data Enthusiast, have a demonstrated history of delivering large and complex projects. Interested in working in the field of AI and Data Science.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store