Stories hidden in the data

A naive bayes classification example

I have collected Forbes’ 2017 billionaire profiles (2043 articles) and new York times' notable deaths 2016 (364 articles). These articles are about individuals. Here I am developing a simple machine learning program for classifying the articles based on Naive Bayes classifier using python/nltk. Let us use...

Machine Learning that Matters

A summary of the paper Machine Learning that Matters by Kiri L. Wagstaff (2012). The paper express concern over the ML research detached from the larger world of science and...

Perwad English Corpus

There are thirteen million words from two sets of text, top books from Project-Gutenberg and Forbes' billionaire profiles, in PEC currently. Here is the statistics. Vocabulary size Share in written English by Oxford by Perwad 10 25% 26.1%...

Most common words in English

Once there lived a poor woodcutter. He used to cut trees in the woods. One day he was cutting wood on the bank of a river. His axe fell down into the river. The river was deep. He could not take his axe out. He sat on the bank...

Assembly programming is worth learning?

C language was my favorite programming tool during my undergraduate and postgraduate. I was keen on optimizing c-programs for speed. The quest for speed drove me to the world of assembly programming. Assembly is the only language that will support to produce highly optimized code. I felt I found the...

Show, don't tell: The first rule of writing

I usually pay the bill before having tea from tea shop. I fear I may leave without paying. Some shopkeepers will tell: "Have it first, pay next." Some other shopkeepers will give a lecture on their service. I can't stand for such mokka1 dialogues. That will make me wrinkle...

Improving the Design of Code after Written

I am basically a PHP programmer; always writing code in procedural way. A PHP developer won't get much chance to apply object oriented concepts; the server side functionality will be simpler or most of the classes will be already available in the PHP framework he is using. A few months...

Skepticism on Software Requirement

I am always skeptical about everything which I cannot understand or I am not familiar with. It often leads me to troubles. In such troubles, I realizes I should speak or take decision based on the proper understanding of the things. Once one of my colleague was worried about the...

Page Views Count

Page views count is the count of number of times a webpage has been viewed. It can be used to list the most read articles. Users may increase page views to make their article most read. Such...