Missing Steve Jobs !! - My First thought after watching "Microsoft Holographics "

Once in a while you can feel that "Eye–Popping , Mind-Boggling" moment on seeing an incredible phenomenon (I would rather consider it as a phenomenon rather than a product). I felt that while watching Microsoft's Holographic promos. I believe that the product is really really "REVOLUTIONARY" and will literally open the gates for innovators to new dimension. But wait, did you notice the way it is introduced, rather shown to us. Disappointing is one of the words I use very selectively, I am sure I can use here. What I see is the greatest thing of the century in the hands off a tasteless nerd. Let me take you to the future to foresee what what might happen

  • People might get bogged down by the geekiness, and in-turn people might try to use it at the bare minimum. Rather forcefully to some extent.
  • "It will not be for all" meaning- it will be still like one of those x86 motherboards where you need some basic knowledge to use it
  • It looks user friendly, but I am damn sure it will not be, to the extent of wide acceptance.

Considering the above, the first thing came across my mind is "What will Apple do, if they have this idea?”. To be precise “What will Steve Jobs do, if he got this idea?”

  • First things first ;-) he will give lot more sleepless nights to his team
  • Unbelievable, stylish and highly user-friendly product
  • Earth breaking "Introduction" , where literally every connected soul might have seen it.

Last but not the least; he will make sure the product is sensible that will be used by variety of people. Obviously it will induce many more copycats, but hey the thing is here to stay.
One more thing, the product will be there for innovators to innovate for the rest of the century. But “how fast” and “how big” varies on how it is un-wrapped to the masses

Google Refine Tutorial


Google Refine – power tool for working with messy data

Detailed Presentation can be viewed @Google Refine Tutorial

1 Introduction

Data cleansing is identifying the wrong or inaccurate records in the data set and making appropriate corrections to the records.It involves identifying incomplete, inaccurate, and incorrect parts of data and then either replacing them with correct data or deleting the incorrect data.Data cleansing results in data which is consistent with the other standard data and is useful for performing various analysis.The error in the data could be due to data entry error by the user, failure during transmission of data or improper data definitions.
Google Refine is a web application, but unlike 99% of web applications, it is intended to be run on one's own machine and used by oneself. The server-side maintains states of the data (undo/redo history, long-running processes, etc.) while the client-side maintains states of the user interface (facets and their selections, view pagination, etc.). The client-side makes GET and POST ajax calls to cause changes to the data and to fetch data and data-related states from the server-side

Google Refine is a powerful tool for effectively cleanse data online.The main features of Google Refine consists of
·         Pulling data from various sources
·         Cleaning the data using Transform/Clusters/Filters
·         Linking to the web URLs to get more useful data
·         Connection with various database to reconcile the collected data




Some of the advantages of Google Refine are
•         Ease of use
•         Works in any browser
•         Extensive functionality
•         Undo/Redo is simply awesome

2 Installation


Google Refine is a desktop application in that you download it, install it, and run it on your own computer. However, unlike most other desktop applications, it runs as a small web server on your own computer and you point your web browser at that web server in order to use Refine. So, think of Refine as a personal and private web application.

Release Version
Install it as detailed below for your operation system.
As long as Google Refine is running, you can point your browser at http://127.0.0.1:3333/ to use it, and you can even use it in several browser tabs and windows.
Development Version (Advanced users who can build from source)
If you want the latest and greatest version, see How to get the development version.

Windows
Install: Once you have downloaded the .zip file, uncompress it into a folder wherever you want (such as in C:\Google-Refine).

Run: In that folder, run the .exe file in that folder. You should see the Command window in which Google Refine runs. By default, the Command window has a black background and text in monospace font in it.

Shut down: When you need to shut down Google Refine, switch to that Command window, and press Ctrl-C. Wait until there's a message that says the shutdown is complete. That window might close automatically, or you can close it yourself. If you get asked, "Terminate all batch processes? Y/N", just press Y.

Mac OSX

Install: once you have downloaded the .dmg file, open it, and drag the Google Refine icon into the Applications folder icon (just like you would normally install Mac applications).

Run: to launch Google Refine, go to the Applications folder and double click the Google Refine app. You'll see the Google Refine app appear in your dock.

Shut down: You can switch to the Google Refine app (clicking on its icon in the dock) and invoke its Quit command.

Linux

Install / Run: Once you have downloaded the tar.gz file, open a shell and type

  tar xzf google-refine.tar.gz
  cd google-refine
  ./refine
this will start Google Refine and open your browser to its starting page.

Shut down: Press Ctrl-C in the shell.

Running & Configuration
By default (and for security reasons) Refine only listens to TCP requests coming from localhost (127.0.0.1). If you want to respond to TCP requests coming to any IP address the machine has, run refine like this from the command line

./refine -i 0.0.0.0



3 Up gradation

Upgradtion can be done from 1.1 to 2.0 by following throught the steps mentioned in the link below

4 Features



Some of the basic features of Google Refine include
  • Importing
The formats currently supported (in version 2.0) include:
a)     TSV, CSV, or values separated by a custom separator you specify
b)    Excel (.xls, xlsx)
c)     XML, RDF as XML
d)    JSON
e)     Google Spreadsheets
f)     RDF N3 triples

Once imported, the data is stored in Google Refine's own format, and your original data file is left undisturbed.

  • Filtering
  • Editing:
    • Editing cells, editing cells by Clustering
    • Editing columns, creating columns by Extending data
    • Editing rows
    • Understanding expressions
    • Understanding regular expressions
  • Exporting
  • History (undo/redo)

5 Getting started

                Let us know more Google Refine in detail while doing a simple project to cleanse the data. The data to be used for the Business Intelligence purpose is “Disasters worldwide from 1900-2008”. For a disaster to be entered into the database, it must meet at least one of the following criteria:

a) Ten (10) or more people reported killed.

b) Hundred (100) or more people reported affecte

c) Declaration of a state of emergency.

d) Call for international assistance.

6 My First Project

Step  1 : Opening a File

Step 2 : Browsing the data





Step 3: Select the File and ‘CREATE PROJECT’

Step 4 : All the projects are listed

 


Step 5 : Project data in Google Refine

7 Transformation made easy /Clustering


Step 1:  Transforming the Type of Calamity data. Click of Type --> Text --> Text Facet


Step 2 : Total number of rows imprted is shown. It also shows the total number of different choices available for that column.(18).

Step 3: On looking closer we can find the occurence of redundancies and duplicates in the data 

Step 4 : To eliminate that we need to create a new column by .Type --> Edit Column --> Add column based on this column

Step 5 : On giving the data as mentioned above we can reduce the number of choices..

So the number of choices came down from 18 to 15.This means the redundant data are merged to single type 

Sometimes it will be complex to do manually.In that case Google Refine provides a CLUSTERING option to cluster data based on different algorithms.They are

While selecting the “metaphone3” algorithm we get the data as follows

While “fingerprint” algorithm is the strictest and safest.On selecting the Merge tab the 2 column values will be merged into one with the value in the “new cell value”


8 Reconcilation

Reconcilation is taking a step further from just cleansing the data to get more information about the data present through freely available online data base.(Freebase)
Reconcile àStart Reconciling…

It will take a couple of minutes to get connected and to get relevant information.

After a quick glance the RECONCILE option gives the most probable type (here country) and we have to select to proceed further.


Now we can find a link with every row .Clicking that link will lead to the online database





9 Other Uses

Using Facebook data to find what they LIKED

And what they LIKED the least

TWITTER DATA to find Time Zones of each follower using Freebase


GOOGLE REFINE


“Water, Water everywhere and not a drop to drink” may be the apt representation of the humungous amount of available data in the current generation ,which cannot be used for useful purpose. The data available are huge in number and if we look carefully ,we can find a lot of redundancies and inconsistency in the data. While redundancy can be reduced at the design level, the inconsistency of the data can be reduced to a large extent by using the GOOGLE REFINE tool.Clustering and other related BI features with elegant UI  makes this tool a powerful Business Intelligence tool. Its advantages
·         Ease of use
·         Extensive functionality
·         Undo/Redo is simply awesome

Some of the disadvantages are

a) Looks more like a spreadsheet
b) Cannot do much operations as in spreadsheet [only for high end purpose]
c) Difficult to handle very large amount of data


ARCHITECTURE

Google Refine is a web application, but unlike 99% of web applications, it is intended to be run on one's own machine and used by oneself. The server-side maintains states of the data (undo/redo history, long-running processes, etc.) while the client-side maintains states of the user interface (facets and their selections, view pagination, etc.). The client-side makes GET and POST ajax calls to cause changes to the data and to fetch data and data-related states from the server-side.



HOW TO GET Google Refine

GOOGLE!!! So,Its free and can be easily downloaded from http://code.google.com/p/google-refine/ .It comes with instructions to install and use it in your PC

CREATING A PROJECT

Just download the tool(Zip) and execute it .  Then get a sample data to do a Business Intelligence and Upload the test data to create a project. Clear instructions from google at every step makes the creation of a project a cakewalk.



TRANSFORMATION of Data made easy

Transformation is its prime weapon.The transformation of data can be done easily as shown below and also lot of intuitive suggestions will arise from Google Refine to make our life easier.




Text Facet Feature to Cluster data

Clustering of data is no more a complex job done by a ETL tool. Clustering of data can be done easily using the "Text Facet" feature from Refine. It gives results as and when the change is made to ensure accuracy of Transformation.





DATA For Analysis 

The data to be used for the Business Intelligence purpose is “Disasters worldwide from 1900-2008”. For a disaster to be entered into the database, it must meet at least one of the following criteria:

a) Ten (10) or more people reported killed.

b) Hundred (100) or more people reported affecte

c) Declaration of a state of emergency.

d) Call for international assistance.



This data is available in http://www.infochimps.com/datasets/disasters-worldwide-from-1900-2008 .


BUSINESS VALUE

As this tool is used mainly for refining the inconsistent data available to make useful predictions based on the data. We have pulled out the disaster database for the past 100 years and hoping to use the acquired skills in analyzing the data

a) to find Pattern involved in the occurrence of these major disasters
b) to predict the next big disaster and its location based on the available data.
c) To analyze the frequency of each and every type of disaster by location.
By,
Vijaya Prabhu (1oBM60097)
Sathishwaran (10BM60079)



How to file against a Retailer asking for more than MRP

Hi,

Recently i had a fight with the shop named "BIMALA SWEETS" in Tech market ,IIT kharagpur.The shop owner asked for more than MRP amount for a cool drink,saying the extra amount is for COOLING :-) :-).

This may appear to be normal to any Indian.But this is illegal,they are actually looting you and also they will not give any receipts.I got fed up with this and wanted to file a complaint against them as an educated individual.

These are my following findings

STEP 1)



Indian Laws: Conditions for Charging above MRP

IConditions for Charging above MRPf a retailer or service provider charges you more than MRP while providing home delivery service, do not pay him the extra money. Remember that if you are ordering stuff at home, you are not utilizing any service of the service provider. The National Consumer Disputes Redressal Commission has ordered that consumers cannot be charged more than the MRP during home delivery services as they are not utilizing any service.  An extra amount can only be charged in restaurants where additional services are provided. So, consumers should be aware of their rights under the Consumer Protection Act, 1986. These rights also enable the consumer to get back the extra money, along with the compensation.
Under the Indian Laws, the following provides consumer protection:
  • Central Consumer Protection Council
  • State Consumer Protection Council
  • District Consumer Information Centre
  • District Consumer Protection Council
  • Consumer Complaint Cells of Business Chambers & Traders’ Associations

Indian Laws: How to File a complaint?

File a complaint on a plain paper, in triplicate, at the seller’s place to the District Consumer Disputes Redressal Forum within 2 years from the date of the cause of action.
You must mention the following information in the application form:
  • Your name along with address
  • Name along with address of the party against whom the complaint is being filed
  • Description of goods or services
  • The price charged
  • Quantity purchased by you
  • The date of purchase
  • Any proof available of purchase
  • Required type of redressal
  • [ courtesy : http://www.lawisgreek.com/purchasing-products-within-mrp/ ]
STEP 2)  
Then go to the site http://www.ncdrc.nic.in/ and select "District forums" to search for the address of the office and complaint abt the issue.

It will be addressed within 90 days .The government has all the facilities ,its upto us to get the right

-VP

DREAM LAND!! is it really what it meant to to be??

Hey there,this blog is about an incident happened in a hotel DREAMLAND inside IIT Kharagpur.It is one of the Hotels which IITians hang out to escape from the Mess food.Now lets get to the floor

I was having a pretty bad day roaming outside IIT to search for a decent hotel.It went in vein and finally chose to come back into IIT to have lunch.Then,I thought its better to go back to Mess than eating in any of the hotels where the taste is pretty much the same .Since I thought of going to hotel first,my heart didnt allow me to pass that hotel without treating my taste buds.So,I stoppped my Karizma (cycle) there and went in.I was welcomed by a guy in the counter and I ordered "Amul Cool".The guy picked it from the fridge, opened it and kept it in front of me.Then I remembered that I had only Rs.500 note and not any change,so asked whether he has any change.He didnt even look at me and started searching something in the dustbin near the counter.I thought he didnt hear me and tried to say to the best possible way I can "ACTING OUT".Then I saw a 100 watt light in his face.I thought he understood what I said,but he slowly picked the Bottle cap he found in the dustbin and covered the opened "Amul Cool" with it.

I just want to post it to let the awareness spread across people to know about the products they buy and most importantly from who :-).