April 22, 2015

Mapping Money Movements to Trap Corruption

In the chequered history of parliamentary legislation in India, the RTI Act stands out as a significant milestone that puts activities of the government under public scrutiny. But even though the Act gives a legitimate platform for citizens to ask questions on, the process is cumbersome and answers are often given in a manner that is not easy to understand or make use of.

But why must a citizen have to ask for something that is his by birthright? Why can the information not be released automatically? But then who decides what information is to be released? At what level of detail? At what frequency?

The biggest challenge facing India is corruption. It is the mother of all problems because it leads to and exacerbates all other problems. If controlled, the money saved can be used to address most deficiencies in health, education and other social sectors.

Misguided people wrongly believe that having a strong Lokpal will solve the problem but when bodies as powerful as the CBI, the CVC have been subverted and compromised by political interests, it is foolish to expect one more publicly funded body, like the Lokpal to be any better. Instead, let us explore how a crowd sourced and data driven approach can help both track and crack the problem.

The central and state governments in India, between them, spend about Rs 25 lakh crores every year. Even if, very optimistically, only 10% of this is lost in corruption, then the presumptive loss to the public exchequer is Rs 2.5 lakh crores every year, compared to the one time loss of Rs 1.8 lakh crores in the  CoalGate scam.

Can we follow the clear stream of public money as it slowly gets lost in the dreary desert sands of government corruption? Most public money starts flowing from the commanding heights of the central government and passes through a complex hierarchy of state and central government departments, municipalities, zilla parishads, panchayats until it reaches the intended beneficiary, who could be a citizen, an employee or a contractor. Flows also begin from money collected as local taxes and routed through similar channels. The accompanying diagram gives a rough idea of this process.

In the language of mathematics, this is a graph consisting of a collection of nodes connected to each other through directed edges. Each node represents a government agency or public body and each directed edge represents a flow of money. Inward pointing edges means that the node or agency receive money while outward pointing edges represent  payments made. Green nodes are sources where money enters -- this could be tax, government borrowings or even the RBI printing notes, while yellow nodes are end-use destination of public money --  salaries, contractor payments, interest payments and direct benefits transferred to citizens.

Ideally, the flow of money through this network should be such that over a period of time the total amount that flows into the network at all green nodes should equal the total that flows out into all yellow nodes. In reality the sum will never add up because there is significant leakage, or theft, in the network. The money lost in transit within the graph between green and yellow nodes is one quantifiable measure of corruption. Another, less obvious case is the inexplicable or unusually high flows of money  -- as in the case of the Chaibasa Treasury at the height of the Fodder Scam or for a sudden spurt of expenditure in widening all roads inside a particular IIT Campus. Any deviation from norms, either historical or from similar expenditure elsewhere, needs an investigation and explanation.

Can such anomalies and deviations be detected? and how does the RTI Act fit into this picture?

This graph is large, the number of nodes and edges is very high and the problem seems insurmountable.  But we can  divide and conquer the problem because many tasks can be done in parallel by independent groups. Instead of focussing on the entire graph at one go, we can zoom-in on certain segments of the graph and examine specific nodes at higher magnification or smaller granularity. In principle, if the flow through every node is accounted for, then the flow through whole graph gets accounted for automatically.

How do we conquer each node?

Since each node is a government organisation, it falls under the RTI Act and we can demand details of all its cash flows. Once this is in the public domain it can be examined by private volunteer investigators either manually or by automated software specifically designed for forensic audit. In fact if Google search bots, or software robots, can crawl through the entire web to track down, rate and index trillions of unstructured web pages, it would not be difficult to build software that can track down and reconcile each and every cash flow transaction in India provided the data is publicly available in a digital format.

The results of such investigations and the unbalanced flows that are revealed should also be in the public domain and would be the starting point of either a formal CAG directed audit or citizen activism directed at the agency concerned. If the cash flows of a particular panchayat or a government agency do not add up or seem suspicious, affected parties should take this up locally either through their elected representatives or through more specific and focussed RTI requests.

This may seem complicated but is not really so. All that we are demanding is that bodies that deal with public money should publish their financial accounts into the public domain in a standardised format. Obviously the accounting format specified for listed companies may not be appropriate since assets and liabilities are accounted for differently and there is no question of profit or loss for public bodies. Instead, the focus is on the cash flow statement. Specifically, how much money is coming in? and where is it going to?

How will this actually work in practice?

First, the CAG and the Institute of Chartered Accountants of India will create a format to report all cash flows in public bodies in terms of a nationally consistent set of cost codes and charge accounts. Next the CIC will mandate that all public bodies must upload this information every quarter into a public website maintained by the CIC. As this information accumulates online, volunteer auditors, public activists, anti-corruption campaigners and even the CAG itself, if it wants to, can collaboratively build a website, like wikipedia or wikimapia, that pulls data from the underlying CIC website and displays the cash flow graph. Even as the graph gets built, people can start looking for missing or unusual flows.

So there is no persistent workload on either the CAG or the CIC. However each public body must prepare its cash flow statement and put it on the CIC website. In any case, a public body is expected to keep a  record of  all cash flows --  through cash, cheque or EFT. This needs to be put into CAG designated format and uploaded periodically to the CIC website.  

In fact the CIC website should already be ready because in October 2014,  the DoPT  announced that henceforth all replies under the RTI Act will be uploaded to the web in any case!

In the initial stages, the graph will have major discrepancies in cash flow and this could be because all agencies, or nodes, have not been identified or are not reporting information. At this point, activists can put in specific RTI requests to the concerned agencies to report their information in the CAG format and the CIC must ensure immediate compliance with the same.  Over a period of time and with a number of iterations the cash flows through all parts of the network should first become visible and then balance out. Compliance will be achieved by continuous RTI pressure at the grassroots, applied in a parallel and distributed manner, on defaulting government agencies.

When known outflows do not match inflows or if there are deviations from expected norms then activists can draw the attention of the media and opposition politicians who can raise a hue and cry. This should lead to the usual process of investigation where the CVC, the CBI or even the Supreme Court can get involved. After all,  Al Capone, the notorious US gangster was finally convicted on the basis of a forensic audit, not a shoot-out police operation!

Unlike the top-down Lokpal driven approach, this bottom-up strategy calls for no new law, no new agency, no new technology, no new infrastructure. All it needs is a format to be defined by the CAG and for the CIC to ensure that any RTI demand in this format be addressed immediately. With this, the flow of public money will become visible and if this transparency leads to a reduction of only 10% of the presumptive loss of Rs 2.5 lakh crores, it would still mean an additional Rs 25,000 crore for the Indian public every year.

19th century British India had the vision, the audacity and the tenacity to carry out the Great Trigonometrical Survey that created the first comprehensive map ( see diagram)  necessary to govern this vast country. Armed with the digital technology of the 21st century, a similar mapping of all public cash flows will lead to greater transparency in the governance of modern India.

a high resolution version of this image is available from wikimedia.

This article first appeared in Swarajya -- the magazine that helps India think Right

April 17, 2015

Big Data for the non-Geek : Hadoop, Hortonworks & H2O

Hadoop is a conceptual delight and an architectural marvel. Anybody who understands the immense challenge of crunching through a humungous amount of data will appreciate the way it transparently distributes the workload across multiple computers and marvel at the elegance with which it does so.

image from nextgendistribution.com
Thirty years after my first tryst with data -- as relational database management systems that I had come across at the University of Texas at Dallas -- my introduction to Hadoop was an eye opener into a whole new world of data processing. Last summer, I managed to Demystify Map Reduce and Hadoop by installing it on Ubuntu and running a few Java programs but frankly I was more comfortable with Pig and Hive that allowed a non-Java person -- or pre-Java dinosaur -- like me to perform meaningful tasks with Map-Reduce. RHadoop should have helped bridge the gap between R based Data Science ( or Business Analytics) and the world of Hadoop but integrating the two was a tough challenge and I had to settle with hadoop streaming API as a means of executing predictive statistics tasks like linear regression with R in a Hadoop environment.

The challenge with Hadoop is that it is meant for men, not boys. It does not run on Windows -- the OS that boys like to play with -- nor does it have a "cute" GUI interface. You need to key in mile long unix commands to get the job done and while this gives you good bragging points in the geek community, all that you can expect from the MBAwallahs and the middle managers at India's famous "IT companies" is nothing more than a shrug and a cold shoulder.

But the trouble is Hadoop ( and Map Reduce) is important. It is an order of magnitude more important than all the Datawarehousing and Business Intelligence that IT managers in India have been talking about for the past fifteen years. So how do we get these little boys ( who think they are hairy chested adult men -- few women have any interest in such matters) to try out Hadoop?

Enter Hortonworks, Hue and H2O ( seriously !) as a panacea to all problems.

First, you stay with your beloved Windows (7 or 8) and install a virtual machine software. In my case, I chose and downloaded Oracle VM VirtualBox that I can use for free, and installed it on my Windows 7 partition. ( I have an Ubuntu 14.04 partition where I do some real work). A quick double-click on the downloaded software is enough to get it up and running.

Next I downloaded the Hortonworks Hadoop Data Platform (HDP) sandbox as an Oracle Virtual Appliance and and imported it into the Oracle VM Virtual Box ( or this) and what did I get? A completely configured single node Hadoop cluster along with Pig, Hive and a host of applications that constitute the Hadoop ecosystem! And all this in just about 15 minutes -- unbelievable!

Is this for real? Yes it is!

For example the same Java WordCount program that I had compiled and executed last year, worked perfectly with the same shell script that I modified with the corresponding Hadoop libraries present in the sandbox.

# for compiling
rm -r WCclasses
mkdir WCclasses
javac -classpath /usr/hdp/ -d WCclasses WordMapper.java
javac -classpath /usr/hdp/ -d WCclasses SumReducer.java
javac -classpath /usr/hdp/ -d WCclasses WordCount.java
jar -cvf WordCount.jar -C WCclasses/ .
# for executing/running

hdfs dfs -rm -r /user/ru1/wc-out2
hdfs dfs -ls /user/ru1
hdfs dfs -ls /user/ru1/wc-inp
hadoop jar WordCount.jar WordCount /user/ru1/wc-inp /user/ru1/wc-out2

But what really takes the cake is that all data movement from the Unix file system to the HDFS file system is through upload/download through the browser. Not the dfs command, -copyFromLocal! This is due to the magic of Hue, a free and opensource product that gives a GUI to the Hadoop ecosystem. Hue can installed, like Pig or Hive on any Hadoop system but in the Hortonworks sandbox, it comes pre-installed.

In fact, since Pig and Hive comes bundled in the sandbox, it is very simple to run Pig and Hive programs by following these tutorials.

But as a data scientist, one must be able to run R programs with Hadoop. To do so, follow instructions given here. But there are some deviations
  1. You log into the sandbox ( or rather ssh -root@ -p 2222) and install RStudio server, but it will NOT be visible at port 8787 as promised until you follow instructions given in this post. Port 8787 in the guest operating system must be made visible to the host operating system.
  2. You can start with installing the packages rmr2 and rhdfs. The other two, rhbase and plyrmr are not really necessary to get started with. Also devtools is not really required. Just use wget to pull the latest zip files and install the same.
  3. However RHadoop will NOT WORK unless the packages are installed in the common library and NOT in the personal library of the userid used to install the packages. See the solution to problem given in this post. This means that the entire installation must be made with the root login. Even this is a challenge because when you use the sudo command, the environment variable CMD_STREAMING is not available and without this package rhdfs cannot be installed. This Catch22 can be overcome by installing without the sudo command BUT giving write privilege to all on the system library, which would be something like /usr/lib64/R/library. 
  4. RStudio server would need a non-system, non-admin userid to access and use
Once you get past all this, you should be able to run the simple R+Hadoop program given in this post, but also the Hortonworks R Hadoop tutorial that uses linear regression to predict website visitors.

 R and Hadoop is fine but converting a standard machine learning algorithm to make it work in the Map Reduce format is not easy, and this is where we use H2O, a remarkable open source product that allows standard machine learning tasks like Linear Regression, Decision Trees, K-Means to be performed through a GUI interface. H2O runs on Windows or Unix as a server that is accessed through a browser at localhost:54321. To install it on the Hortonworks HDP sandbox in the Oracle VM VirtualBox, follow instructions given in this Hortonworks+H2O tutorial.

In this case, you will (or might) face these problems
  1. The server may not be available at ( the principal IP address ) but at the ( or local host) 
  2. The server may not become visible until you configure the VM to forward ports as explained in the solution given in this post.
 Once you have H2O configured with Hadoop on the sandbox, then all normal machine learning tasks should be automatically ported to the map-reduce format and can benefit from the scalability of Hadoop.

So what have we achieved ? We have ...
  1. Hadoop, the quintessential product to address big data solutions
  2. Hortonworks that eliminates the problems associated with installing a complex product like Hadoop, plus its associate products like Pig, Hive, Mahout etc
  3. The Oracle VM VirtualBox that allows us to install Hortonworks in a Windows environment
  4. Hue that gives a GUI to interact with the Hadoop ecosystem, without having to remember long and complex Unix commands.
  5. RHadoop that allows us to use RStudio to run Map Reduce programs
  6. H2O that allows us to run complete machine learning jobs, either in stand alone mode or as Map Reduce jobs in Hadoop through an intuitive GUI.
If you think this is too geeky, then think again if you really want to get into data science! In reality, once we get past the initial installation, the rest is very much GUI driven and the data scientist may just feel that he ( or she) is back in the never-never land of menu driven software where you enter values and press shiny buttons. But in reality you would be running heavy duty Hadoop programs, that in principle, can crunch through terabyes of data.

March 21, 2015

Why Not Vote Through ATMs?

The Chief Election Commissioner has recently stated that the EC is planning to use web and mobile based technology to allow citizens to cast their vote in local and national elections. Is this feasible? Will it be fair? Or secure? The challenge is indeed daunting but let us see how we can leverage an existing technology infrastructure to reach this goal easily and at a very low cost.

Online voting is not a new idea. The Computer Society of India and many public limited companies are already using the internet to allow members and shareholders to vote by logging into websites. The challenge is to make sure that only those who are authorised to vote are allowed to login to the site and this is ensured by sending unique userids and passwords by email. Obviously this assumes that every member of the electorate has a valid and validated email ID. But this would not be true when we consider the kind of people who are voting in panchayat, municipal,  state legislature and Lok Sabha elections.

image "borrowed" from http://nohandcuffs.com

This difficulty can be overcome with small, inexpensive hardware devices  that are used for secure logins in some banks and multinational companies but the cost and difficulty of distributing such devices is very high. Finally we need to secure such web based systems from  sophisticated hackers, cyber criminals and from cyber attacks from hostile countries. Since this is a big challenge, we need to consider alternatives.

Consider the ATM network that already spans the entire country.

Having evolved over the years, ATMs are viewed as stable and mature platforms for critical services. Moreover each usage of an ATM card is clearly and unequivocally tagged to a bank account that in turn is connected to person identified by a rigorous Know-Your-Customer (KYC) process. So while I can always give my ATM card to my wife and tell her the PIN code, I cannot deny or repudiate the actions that are performed with the card. The bank holds me responsible for any money withdrawn or transferred except when -- and this is rare -- the card has been stolen, along with the PIN, and used. The first ATM in India was set up by HSBC in 1987 and after a quarter of century of usage, the technology has instilled a sense of comfort both in banks as well as in people that is indeed reliable. Moreover all ATMs can be used to access funds lying in any bank in India.

What if the software in all ATM machines were to be upgraded to include an additional feature to allow voting? Just as the LPG Consumer number, given by the oil companies, is being connected to a bank account, with or without the existence of the Aadhar card, so can a Voter Card number be connected to a bank account and become verifiable through an associated ATM card. This means that if one were to slide an ATM card into an ATM machine it will uniquely identify the individual on the Electoral Roll. Now it is a simple matter for the system to determine if one is eligible to vote in any particular election that is being conducted by the Election Commission at any point of time. For example, if on a particular date, a by-poll is being conducted in a particular constituency, only those who are registered voters in that constituency, would be allowed to press a button and cast their votes -- and obviously, only once. For everyone else, the button to vote will be de-activated. If multiple elections are being conducted on the same date, a menu can be shown to the user so that he can choose the one specific election where he can cast his vote.

And as a by-product, the voter need not be physically presented in the geographic area where the election is being held. Votes can be cast by people living or working far away from their home constituencies as long as they can reach an ATM machine.

Now let us do some back-of-the-envelope calculations to check the feasibility of the numbers involved. The total number of voters in India is approximately 82 crores while the number of bank accounts is around 65 crores. That looks nice until we realise that many people have multiple accounts and so the number of distinct account holders could be about 10 crores, leaving a gap of 70 crores. However under the PM’s Jan Dhan Yojana for financial inclusion, almost 11 crore new bank accounts have been opened in the last 6 months. If we continue at this pace then it is a matter of 4 or 5 years before we can provide every voter with a bank account. In fact, in the more densely populated and urban areas, full coverage of almost all voters with bank accounts can occur much earlier.

So the proposal to use bank account linked ATM cards to validate voters and allow them to cast votes at ATM machines could actually be an extension of the Government of India’s publicly stated goal of total financial inclusion for the entire population. Universal franchise and universal banking could in fact be made two sides of the same coin of technological development!

But wait, there is more to come! Last year, the President of India inaugurated RuPay, an Indian domestic card scheme launched by the National Payments Corporation of India (NPCI). It was created to fulfill the Reserve Bank of India’s desire to have a domestic electronic payments system in India. RuPay facilitates electronic payment at all Indian banks and financial institutions, and is comparable with MasterCard and Visa in India. Banks in India are authorized to issue RuPay debit cards to their customers for use at ATMs, PoS terminals, and e-commerce websites. Many banks, including all major public sector banks, currently issue RuPay cards to their customers and RuPay cards are also issued at about 200 cooperative and rural banks to promote financial inclusion.

According to data published by the NCPI, there are almost 1.45 lakh ATMs in India that accept the RuPay card. This number is comparable to the 8.35 lakh polling booths that were used during the 2014 Lok Sabha polls and so can be used to reduce the load on the traditional EVM based booths significantly. This will also lead to a surge in the issuance and usage of RuPay cards and help it break into the market dominated by the global giants.

Let us see how the process works.

First, the voter card numbers of all eligible voters needs to be replaced with a uniform 16 digit number that reflect state and the constituency that the voter is eligible to vote in. Next, every voter would need to have a KYC-compliant bank account that will be linked to this voter card number. Any attempt to link the same voter card number to multiple bank accounts will be caught automatically. So the RuPay debit card, or any other ATM card, that is linked to the account becomes the de-facto voting card for those choosing to use the electronic voting option. However all those who still want to cast votes at EVM booths can continue to use the traditional voter card. As a one-time effort, people must decide which option they will exercise. .

Fortunately most of the physical components of the ATM network would remain the same and only the software would need to be changed. Today, when someone uses an ATM card, the validity of the card, the PIN and  the balance available in the bank account is checked against the user’s own bank computers. In this case, a similar validation will be done against the Election Commission’s computers to determine the voter number of the person and the constituency that he is eligible to vote in.

Finally, when an election is being held in any constituency, the software will allow only those who are eligible to vote in that constituency to cast their votes. So instead of having to go over to that one, single designated booth where he is registered, the vote can simply walk into the nearest ATM machine and vote in a safe, secure and convenient manner. Not only will this be convenient for the voter, it will also reduce the pressure on the election machinery as the number of polling booths required will be significantly reduced. Moreover, with votes being cast in a distributed manner the threat of booth capture, physical intimidation of voters and the casting of false votes can be significantly reduced.

The scheme has many advantages. There is no major investment in physical infrastructure. Using bank accounts and RuPay cards will accelerate financial inclusiveness and popularize the card while the Election Commission will need less money and manpower. Finally, voters will be able to cast their votes in a secure and convenient manner from anywhere in the country.

Who could ask for anything more?

This article appeared in March Issue of Swarajya

February 22, 2015

Painted Storks at Rabindra Sarovar

This year, all of a sudden and perhaps for the first time since I have been visiting the Rabindra Sarovar in south Calcutta, we have had a small flock of painted storks staying over in the little island in the middle of the lake.

Initially, I thought that they had come from Siberia or some other very cold place but it seems that is not quite true. They are native to India.

this map shows the island where the pictures were taken

February 17, 2015

Satyam ? Shivam ? or Sundaram ? of Shivaratri

Folklore celebrates Shivaratri as the marriage of Lord Shiva with Parvati, the daughter of the Himalayan mountains. The consummation of this marriage is viewed in learned circles as the Yoga, or union, of Shiva with his Shakti. Of the Purusha with his Prakriti. Shiva is the Potential energy of the universe and Shakti is its manifestation in Kinetic form. From Sankara to Vivekananda, adepts have likened Shiva to the Ocean and Shakti to the waves that rise from it and settle back.

But then again there is another perspective!

Shiva is not a God, not even a god, in the traditional dualistic perspective of a devotee and the object of devotion. Shiva is a quality, the quality of "good", as understood in the phrase, "Satyam, Shivam, Sundaram" -- the triad of the the True, the Good and the Beautiful.

Truth is viewed as function of rational analysis as in the search for the truth. This is the way of the West -- of Egypt, Greece, the Renaissance and modernity. This has led us to major advances in science, technology, engineering and mathematics and gives us the comfortable lifestyle associated with modern societies. Unfortunately, Godel spikes this model with his Theory of Incompleteness, and tells us that there could stuff that is true but not provable. Rationality is bounded by its own inherent limitations.

Good is what is perceived by the adepts of the East -- Advaita Vedanta, Zen ( Dhyan / meditative ) Buddhism and the eroticism of Tantra -- through direct experience, the Samadhi or Ecstasy of Yoga. Logic and reason is passed over in the search for that which is without form, without shape and what represents that which cannot be captured or explained in the logic of a language and its grammar ! This is explained further in my book, the Road to pSingularity.

Beauty lies at the intersection of what is True and what is Good! In fact, when the True and the Good converge into each other, what is left is the sheer beauty of the experience. This is beauty of Shivaratri that I share here in the pictures of NP Razeshwarr.

You can see many more such pictures of NP Razeshwarr or know more about him and his art by following this links.

February 12, 2015

Andamans : From Kalapani to Neelapani

We all know about the Andamans from our history books but rarely do we get the opportunity to experience the geography of this very picturesque part of India. Here is a short, photographic record of our visit to this little visited part of India.

We arrived in Port Blair and walked into the pages of the history book that tells us about the Cellullar jail where so many freedom fighters, ( or "terrorists" from the British point of view) were incarcerated or even martyred.

the outside of the jail

inside the jail, during the Sound and Light Show

inside one of the cells

the long corridor along one of the wings

view from the roof

from the watch tower

these were the actual gallows where prisoners were executed and the picture below shows the oil mill that prisoners had to operate to produce their quota of coconut oil

two of the jail wings from the watch tower

this is the room were Veer Savarkar was incarcerated

The cellular jail was referred to as Kalapani, the place beyond the Black Waters of the unknown sea. But today, we see this sea, not as black but as a brilliant shade of blue, the Neela Pani!

This is Elephant Beach on Havelock Island

the sea at Elephant Beach is really so blue

and if you go snorkeling, as we did, you can see coloured fishes swimming in the crystal clear waters

However the he most exciting activity at Elephanta Beach was the undersea walk

where we literally walked, and danced, among the corals and the fishes and had professionals help us and take pictures like this

The hotel at Havelock Island where we stayed was awesome. It was right on the waterfront itself

as everywhere in the Andamans, the sea was an amazing colour of blue

The Makkruze catamaran was our means of getting around the various islands of the Andamans. That is how we departed from Havelock and ...

at the pier at Havelock, waiting for the Makruz to arrive

and arrived at Neil ( or perhaps Neel?) Island

At Neil Island we had the pleasure of watching corals and coloured fishes from a glass bottom boat

and ride JetSkis

and also visited Radhanagar Beach, one of the top ten beaches of the world

Our last port of call was Ross Island, the administrative, social and cultural heart of the British Administration at Andamans that was occupied, along with the rest of islands by the Japanese during WWII and is now naval base for the Indian Navy. There are many remnants of the Japanese occupation here.

these are ruined buildings from the Japanese era

and these are Japanese gun emplacements

exploring Ross Island on foot was almost like living out an adventure from the Famous Five books of Enid Blyton. Islands, forests, deer, abandoned bunkers and what not!

this is far side of Ross Island, the part normally not visited by tourists

we also saw some Japanese Bunkers at Corvyn Beach

Now that we have seen all the pictures let us say a few words about the Andaman Islands. Though these are a part of India they are physically closer to Myanmar and Malaysia. In fact, the islands are a part of the Arakan Range of mountains that run through Myanmar and represent the peaks of mountain range that is otherwise submerged in the waters of the Bay of Bengal. Hence walking through Port Blair will remind you of walking through Darjeeling or Simla -- the roads rise and fall like any hill station road -- but of course the temperature is quite warm since it is very close to the mean sea level. So you can call it a sea level hill station!

Most of the people on the Andamans are Bengali speaking settlers from erstwhile East Bengal and Bangladesh plus Hindi speaking immigrants from the heartland of India. The original people of Andamans -- the so called aboriginals, of negroid stock -- are almost extinct, wiped out after coming in contact with modern society. However small pockets of the original natives survive on reservations where us, modern Indians, are not allowed to go without specific permission from government authorities. 

