Game to improve the quality of Wikipedia

Today, a beta version of the online WikiBest game was announced, which is part of the research on data quality in Wikipedia. It is noteworthy that at present the game allows you to compare the quality of data in 5 language versions of Wikipedia: Russian, Ukrainian, Belarusian, Polish, English. In the near future it is planned to expand the number of languages.
 
 
Game to improve the quality of Wikipedia
 
automatic quality assessment articles in this free encyclopedia. However, a large number of problems still remain to be solved. For example, how to automatically evaluate or compare the quality of individual facts in different language versions ...
+ +1 -

Collection of demographic stories in one map

Collection of demographic stories in one map
 
In the recent issue of the magazine
The Lancet
published my article is a curious map and a little explanation for it. I decided to tell about this on Habr, because there is a hope that the implemented way of visualizing the data can be useful to someone else.
 
Kashnitsky , I., & Schöley , J. (2018). Regional population structures at a glance.
The Lancet
, 392 (10143), 209-210. https://doi.org/???/S0140-6736(18)31194-2
Actually, here is a high-resolution map (clickable).
 
...
+ 0 -

We struggle with mistakes and "crutches" in the Unified State Register of Legal Entities - state register of legal entities

We struggle with mistakes and "crutches" in the Unified State Register of Legal Entities - state register of legal entities  
 
Last week we released article about the device USRLE - the state register with data of 10 million companies. That stuff talks about basic things, so it's better to start with it.
 
 
Here we will reveal a rich and fertile topic - the problems of the USRLE, which do not let our developers get bored.
 
Single client ". He puts the data in order: cleans addresses, finds duplicates, corrects typos.
 
 
If you like parsing complex reference books, structuring data and bringing them to a human kind, come to work with us. Now we are looking for a javista for the product "Factor". Salary - from ...
+ 0 -

How is the USRLE - the unified state register of legal entities

How is the USRLE - the unified state register of legal entities  
 
The USRLE is a state register of legal entities in which 10 million Russian companies are kept. Manages the FTS directory.
 
 
From the USRLE we take the data of organizations for " Tips "," Single Customer "And" Factor ". In the article we will tell you how we lived before the directory, how we get access to it and how we work with it.
 
multistat.ru - this is a legal reseller who sold the data of the Federal Tax Service. The problem is that Multistat gave its base with a high price without updates.
 
 
Therefore, we maintained the relevance ...
+ 0 -

Finding the number of commissions "drawing" the whole value of turnout at the presidential election of the Russian Federation in 2018

Finding the number of commissions "drawing" the whole value of turnout at the presidential election of the Russian Federation in 2018Graphs with unusual peaks we now see after every federal election. For the first time they went to the masses after the elections in 201? when people saw falsifications, and got acquainted with the analysis of election data and the problem of integer division in particular.
 
 
Distributions even began to appear their names. This and "Churov's beard" for the 2011 elections, and "Volodin Peak" for the famous 62.2% in Saratov. Since until now even on the habr there are articles, not familiar with the solution of the problem of integer division and do not agree " ...
+ 0 -

Parsing 0.5Tb xml for a few hours. Search of organizations by criteria in the register of subjects of SMEs of the Federal Tax Service

Parsing 0.5Tb xml for a few hours. Search of organizations by criteria in the register of subjects of SMEs of the Federal Tax ServiceBy the nature of the activity (automation of processes and the development of the architecture of information systems), one often has to deal with the need to write a script and get the result "here and now" for an unexpectedly "arrived" task in a situation where there is no possibility of promptly attracting external developers.
 
 
The review will be devoted to solving one such problem. At some point, there was a need to analyze, based on the open data of the "Single Register of Small and Medium Enterprises" of the Federal Tax Service (RNSS), the dynamics by months of the number of organizations ...
+ 0 -

AI.Hack St. Petersburg

<{full}>
Hello, Habr! In this post I'll tell you about one of the coolest
hackathon
with the DS-track, held recently in St. Petersburg. Under the cut - a general overview, the cases that we decided, and, of course, about how both teams of the AU were able to become winners.
 
 
AI.Hack St. Petersburg
 
hackathon
with the DS-track, held recently in St. Petersburg. Under the cut - a general overview, the cases that we decided, and, of course, about how both teams of the AU were able to become winners.
 
 

 
 

Introduction


 
This is the third post from ...[/h]
+ 0 -

Council on Open dаta: Openness of Rosreestra and Federal Property Management Agency, results of 2017 and plans for the future

Council on Open Data: Openness of Rosreestra and Federal Property Management Agency, results of 2017 and plans for the future
 
 
At the end of April, the regular meeting of the Council on Open Data was held, the agenda of which was the openness of Rosreestra and Rosimushchestvo, summing up the activities of the Council on open data and plans for the future.
 
site Rosreestra , but to use it (copy data, reproduce, distribute, publish ) without the written permission of Rosreestr is prohibited. According to the specialists' comments, the problem of lack of access to machine-readable data lies not in the cartographic data itself, but in that it takes into account the protection of copyrights to the cartographic substrate (i.e...
+ 0 -

How to make the state open, Part 1: Download the statistics of the accident with their own hands

How to make the state open, Part 1: Download the statistics of the accident with their own handsIf you look good, you can find quite a lot of useful, decent quality, government information. But unfortunately, it's still not: EGE and education, weather, cartography, data about crimes and road accidents.
 
 
Therefore, I have two lives: in one I help officials to open data that people or organizations ask, and in another I write parsers that turn public databases of particularly "stubborn" government agencies into open data and teach it to others, in the hope that such projects will be many, the state will reconcile with the inevitable and all will lay out in a convenient form.
 
 
This article ...
+ 0 -

How we participated in the hackathon from OpenData

Hello everyone, in this article I want to talk about Why So Serious Hack . About what generally led us there than the Hakaton in the classical sense differ from the Hakaton with the contest and that helped us to win.
 
 
How we participated in the hackathon from OpenData
 
here . Dudes with mat-fur (red pandas) are the first, we (AU-Rocks) - the second.
 
 
By the way, about the titles: we, acting as a team of the Academic University, always choose the name of the team with the prefix AU, red pandas do like. I think it's very cool when you come to the event, and there you can already know the familiar people by the name of the team, and they ...
+ 0 -