the nestoria geochallenge

34
The GeoChallenge Savio Dimatteo Perl Developer Lokku Ltd. #geomob London - 11 July 2013

Upload: lokku

Post on 08-May-2015

2.844 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: The Nestoria GeoChallenge

The GeoChallengeSavio DimatteoPerl DeveloperLokku Ltd.

#geomob London - 11 July 2013

Page 2: The Nestoria GeoChallenge

We map millions of properties

Page 3: The Nestoria GeoChallenge

Across multiple countries

Page 4: The Nestoria GeoChallenge

What we ask for <listing>

...

<placenames> <building number="35">Napier Court</building> <street>Cropley Street</street> <area>Hackney</area> <town>London</town> <region>Greater London</region> <province>South East England</province> </placenames>

<location>

<latitude>51.532855</latitude>

<longitude>-0.0902938</longitude>

<postcode>N1 7HS</postcode>

</location>

...

</listing>

Page 5: The Nestoria GeoChallenge

What we get

<listing>

...

<placenames>

<neighbourhood>Iporanga</neighbourhood><region>SP</region><town>Sorocaba</town>

</placenames>

...

</listing>

Page 6: The Nestoria GeoChallenge

What we get...

<location>

<coordinate_system>WGS84</coordinate_system><latitude>0.000000</latitude><longitude>0.000000</longitude><placenames> <area>Mansarovar</area> <street>

Parsavanath at Sanganer, near mansarover muhana mandi

</street><town>Jaipur</town>

</placenames>

</location>

...

Page 7: The Nestoria GeoChallenge

What we get...

<location> <full_address>

, Porcia, Pordenone

</full_address>

<postcode>33080</postcode>

<country>IT</country>

<coordinate_system>WGS84</coordinate_system>

<latitude></latitude><longitude></longitude><coordinate_confidence unit="m">50</coordinate_confidence>

</location>...

Page 8: The Nestoria GeoChallenge

What we get...

<location> <full_address>

, Porcia, Pordenone

</full_address>

<postcode>33080</postcode>

<country>IT</country>

<coordinate_system>WGS84</coordinate_system>

<latitude></latitude><longitude></longitude><coordinate_confidence unit="m">50</coordinate_confidence>

</location>...

AndreisBanniaBarcisCastionsCimolaisCimpelloClautErto e CassoFiume VenetoFrisancoGhiranoOrcenico InferiorePalsePoffabroPorciaPrata di PordenonePuiaRoraipiccolo

Page 9: The Nestoria GeoChallenge

What we get...<location> <country>IN</country> <full_address> M.G.RdNr Rly Stn -East Borivali (East) Mumbai Andheri-Dahisar </full_address> <coordinate_system>WGS84</coordinate_system> <latitude>19.23351</latitude> <longitude>72.83678</longitude> <placenames> <province>Maharashtra</province> <region>Mumbai Andheri-Dahisar</region> <town>Mumbai Andheri-Dahisar</town> <area>Borivali (East)</area> <neighbourhood>M.G.RdNr Rly Stn -East</neighbourhood> </placenames></location>

...

Page 10: The Nestoria GeoChallenge

What we get...<location> <country>IN</country> <full_address> M.G.RdNr Rly Stn -East Borivali (East) Mumbai Andheri-Dahisar </full_address> <coordinate_system>WGS84</coordinate_system> <latitude>19.23351</latitude> <longitude>72.83678</longitude> <placenames> <province>Maharashtra</province> <region>Mumbai Andheri-Dahisar</region> <town>Mumbai Andheri-Dahisar</town> <area>Borivali (East)</area> <neighbourhood>M.G.RdNr Rly Stn -East</neighbourhood> </placenames></location>

...

Page 11: The Nestoria GeoChallenge

What we get...<location> <country>IN</country> <full_address> M.G.RdNr Rly Stn -East Borivali (East) Mumbai Andheri-Dahisar </full_address> <coordinate_system>WGS84</coordinate_system> <latitude>19.23351</latitude> <longitude>72.83678</longitude> <placenames> <province>Maharashtra</province> <region>Mumbai Andheri-Dahisar</region> <town>Mumbai Andheri-Dahisar</town> <area>Borivali (East)</area> <neighbourhood>M.G.RdNr Rly Stn -East</neighbourhood> </placenames></location>

...

Page 12: The Nestoria GeoChallenge

What we get...<location> <country>IN</country> <full_address> M.G.RdNr Rly Stn -East Borivali (East) Mumbai Andheri-Dahisar </full_address> <coordinate_system>WGS84</coordinate_system> <latitude>19.23351</latitude> <longitude>72.83678</longitude> <placenames> <province>Maharashtra</province> <region>Mumbai Andheri-Dahisar</region> <town>Mumbai Andheri-Dahisar</town> <area>Borivali (East)</area> <neighbourhood>M.G.RdNr Rly Stn -East</neighbourhood> </placenames></location>

...

Page 13: The Nestoria GeoChallenge

What we get...<location> <country>IN</country> <full_address> M.G.RdNr Rly Stn -East Borivali (East) Mumbai Andheri-Dahisar </full_address> <coordinate_system>WGS84</coordinate_system> <latitude>19.23351</latitude> <longitude>72.83678</longitude> <placenames> <province>Maharashtra</province> <region>Mumbai Andheri-Dahisar</region> <town>Mumbai Andheri-Dahisar</town> <area>Borivali (East)</area> <neighbourhood>M.G.RdNr Rly Stn -East</neighbourhood> </placenames></location>

...

Page 14: The Nestoria GeoChallenge

What we get...<location> <country>IN</country> <full_address> M.G.RdNr Rly Stn -East Borivali (East) Mumbai Andheri-Dahisar </full_address> <coordinate_system>WGS84</coordinate_system> <latitude>19.23351</latitude> <longitude>72.83678</longitude> <placenames> <province>Maharashtra</province> <region>Mumbai Andheri-Dahisar</region> <town>Mumbai Andheri-Dahisar</town> <area>Borivali (East)</area> <neighbourhood>M.G.RdNr Rly Stn -East</neighbourhood> </placenames></location>

...

Page 15: The Nestoria GeoChallenge

is there any hope?

Page 16: The Nestoria GeoChallenge

More data

<full_description><![CDATA[ Property Code: RS434487<BR> ]]>

</full_description>

Page 17: The Nestoria GeoChallenge

More (useful) data

<full_description>Box situé dans la cour d'un immeuble à VITRY

SUR SEINE, rue Marie Sorin Defresne. Libre de

suite Loyer trimestriel: 230 € Contacter Madame GOYARD ...

</full_description>

Page 18: The Nestoria GeoChallenge

More (useful) data<full_description>CASA EM CONDOMÍNIO A Venda 5 Dormitórios - Iporanga, Sorocaba - SP. Sobrado em Condomínio com 2 salas, divididas em vários ambientes, cozinha planejada, despensa, lavabo, banheiro social, 5 dormitórios sendo 1 suíte com closet e 1 suíte master, sacada, escritório, área de serviço, piscina, espaço gourmet, quintal e garagem coberta para 4 carros. Acabamento Interno: Salas e escritório com piso porcelanato. Cozinha, banheiro social e suítes com piso cerâmico. Dormitórios com piso laminado de madeira. Salas com teto rebaixado e sanca em gesso, paredes com detalhes em textura. Cozinha planejada com coifa, fogão cooktop, grande mesa central em granito preto, revestimento nas paredes com detalhes em pastilhas, enorme pia com gabinete em granito preto. Banheiro social com revestimento nas paredes, pia em mármore travertino, box em vidro temperado. Escritório amplo com armários, parede com detalhes em textura. Acabamento Externo: Área de serviço com revestimento nas paredes com detalhes em pastilhas. Quintal em piso cerâmico, paredes com detalhes em textura, enorme pia e balcão em granito. Piscina com cascata e piso em torno em pedra mineira. Churrasqueira com paredes e fundo em vidro temperado. Garagem com acesso via portão eletrônico, paredes com detalhes em textura. Jardim em toda sua parte frontal, porta de acesso principal em madeira maciça e fechadura eletrônica, pedra miracema na calçada. Piso cerâmico na entrada do imóvel. Acessórios do Imóvel: Espaço gourmet, piscina, interfone, aquecedor elétrico solar, portão eletrônico e tv a cabo. Acessórios do Condomínio: Lazer: Lago, Pista de Caminhada, Quadra Poliesportiva, Quadra de Tênis, Pista de Skate, Vestiários, Espaço Gourmet com Salão de Festas e Churrasqueira, Sala de ginástica, Salão de jogos, Playground. Segurança: Totalmente murado, portarias com monitoramento de acesso 24hs, cancelas automáticas, cerca elétrica perimetral e ronda 24hs. Infraestrutura da Região: Localizado próximo a Zona Industrial de Sorocaba, possui acesso fácil tanto para a Rod. Castelo Branco, quanto para o Centro de Sorocaba.</full_description>

Page 19: The Nestoria GeoChallenge

we can't trust data

Page 20: The Nestoria GeoChallenge

The GeoChallenge

can we still geocode?

Page 21: The Nestoria GeoChallenge

Veocoder

Page 22: The Nestoria GeoChallenge

any text

Lorem Ipsum is simply dummy text of the printing and typesetting industry. Lorem Ipsum has been the industry's standard dummy text ever since the 1500s, when an unknown printer took a galley of type and scrambled it to make a type specimen book. It has survived not only five centuries, but also the leap into electronic typesetting, remaining essentially unchanged. It was popularised in the 1960s with the release of Letraset sheets containing Lorem Ipsum passages, and more recently with desktop publishing software like Aldus PageMaker including versions of Lorem Ipsum.

Veocoder

Page 23: The Nestoria GeoChallenge

Veocoder

any text

Lorem Ipsum is simply dummy text of the printing and typesetting industry. Lorem Ipsum has been the industry's standard dummy text ever since the 1500s, when an unknown printer took a galley of type and scrambled it to make a type specimen book. It has survived not only five centuries, but also the leap into electronic typesetting, remaining essentially unchanged. It was popularised in the 1960s with the release of Letraset sheets containing Lorem Ipsum passages, and more recently with desktop publishing software like Aldus PageMaker including versions of Lorem Ipsum.

places

area

area

area

Page 24: The Nestoria GeoChallenge

Veocoder

any text

Lorem Ipsum is simply dummy text of the printing and typesetting industry. Lorem Ipsum has been the industry's standard dummy text ever since the 1500s, when an unknown printer took a galley of type and scrambled it to make a type specimen book. It has survived not only five centuries, but also the leap into electronic typesetting, remaining essentially unchanged. It was popularised in the 1960s with the release of Letraset sheets containing Lorem Ipsum passages, and more recently with desktop publishing software like Aldus PageMaker including versions of Lorem Ipsum.

places hierarchies

area

area

area

best place

Page 25: The Nestoria GeoChallenge

Veocoder

any text

Lorem Ipsum is simply dummy text of the printing and typesetting industry. Lorem Ipsum has been the industry's standard dummy text ever since the 1500s, when an unknown printer took a galley of type and scrambled it to make a type specimen book. It has survived not only five centuries, but also the leap into electronic typesetting, remaining essentially unchanged. It was popularised in the 1960s with the release of Letraset sheets containing Lorem Ipsum passages, and more recently with desktop publishing software like Aldus PageMaker including versions of Lorem Ipsum.

places hierarchies

area

area

area

best place

Page 26: The Nestoria GeoChallenge

But other geocoders?

● they expect "good" input

● price is crazy

● they rate limit

Page 27: The Nestoria GeoChallenge

Geocoding Task

Where is:

Lotus County, Site: Sy No. 122/2, Solapur Road, Opp.To Gate No-6 Mirc Darewadi,

Ahmednagar., Opp.To Gate No-6m, Ahmednagar

Page 28: The Nestoria GeoChallenge

Geocoding Task (using http://www.gisgraphy.com/compare)

Page 29: The Nestoria GeoChallenge

Geocoding Task - Veocoder

Page 30: The Nestoria GeoChallenge

Geocoding Task - Veocoder

Page 31: The Nestoria GeoChallenge

think you could do better?

Page 32: The Nestoria GeoChallenge

come and work with us!

lokku.com/jobs

Page 33: The Nestoria GeoChallenge

Thank Youhttp://www.nestoria.com find propertyhttp://geomobldn.org stay tuned!http://www.github.com/lokku open source codehttp://www.lokku.com/jobs currently hiring!

Page 34: The Nestoria GeoChallenge

Inside the hierarchy generator