Tim Berners-Lee goes on the warpath: "One small step for the Internet "

Tim Berners-Lee goes on the warpath: "One small step for the Internet "
 
 
I have always believed that the Internet is for everyone. That's why I and everyone else are fighting fiercely to protect him. The changes we have achieved have created a better and more connected world. But besides all the good that we have achieved, the network has become the engine of injustice and separation; influenced by powerful forces using it for their own purposes.
 
 
Today I believe that we have reached a critical turning point, and this fundamental change for the better is possible and necessary.
 
 
That is why I have worked with several people at MIT and other places in recent years to ...
+ 0 -

Identification of content profiles in VK

Bots to distinguish from people and the truth is complicated. I myself can not really do it myself. But I came up with a good bicycles
method, how to distinguish in VK "interesting people" from "not very interesting". In terms of network communication, of course, not in life.
 
Identification of content profiles in VK

 
VK put a restriction on the ability to download the contents of the walls of users , and slowly it hurts. Those. It is possible, but it is necessary to greatly refine, optimize and dodge to circumvent the restrictions.
 
 

The basic idea is


 
The main idea is that bots, dull (in the network plan) personality...[/h]
+ 0 -

Game to improve the quality of Wikipedia

Today, a beta version of the online WikiBest game was announced, which is part of the research on data quality in Wikipedia. It is noteworthy that at present the game allows you to compare the quality of data in 5 language versions of Wikipedia: Russian, Ukrainian, Belarusian, Polish, English. In the near future it is planned to expand the number of languages.
 
 
Game to improve the quality of Wikipedia
 
automatic quality assessment articles in this free encyclopedia. However, a large number of problems still remain to be solved. For example, how to automatically evaluate or compare the quality of individual facts in different language versions ...
+ +1 -

Collection of demographic stories in one map

Collection of demographic stories in one map
 
In the recent issue of the magazine
The Lancet
published my article is a curious map and a little explanation for it. I decided to tell about this on Habr, because there is a hope that the implemented way of visualizing the data can be useful to someone else.
 
Kashnitsky , I., & Schöley , J. (2018). Regional population structures at a glance.
The Lancet
, 392 (10143), 209-210. https://doi.org/???/S0140-6736(18)31194-2
Actually, here is a high-resolution map (clickable).
 
...
+ 0 -

We struggle with mistakes and "crutches" in the Unified State Register of Legal Entities - state register of legal entities

We struggle with mistakes and "crutches" in the Unified State Register of Legal Entities - state register of legal entities  
 
Last week we released article about the device USRLE - the state register with data of 10 million companies. That stuff talks about basic things, so it's better to start with it.
 
 
Here we will reveal a rich and fertile topic - the problems of the USRLE, which do not let our developers get bored.
 
Single client ". He puts the data in order: cleans addresses, finds duplicates, corrects typos.
 
 
If you like parsing complex reference books, structuring data and bringing them to a human kind, come to work with us. Now we are looking for a javista for the product "Factor". Salary - from ...
+ 0 -

How is the USRLE - the unified state register of legal entities

How is the USRLE - the unified state register of legal entities  
 
The USRLE is a state register of legal entities in which 10 million Russian companies are kept. Manages the FTS directory.
 
 
From the USRLE we take the data of organizations for " Tips "," Single Customer "And" Factor ". In the article we will tell you how we lived before the directory, how we get access to it and how we work with it.
 
multistat.ru - this is a legal reseller who sold the data of the Federal Tax Service. The problem is that Multistat gave its base with a high price without updates.
 
 
Therefore, we maintained the relevance ...
+ 0 -

Finding the number of commissions "drawing" the whole value of turnout at the presidential election of the Russian Federation in 2018

Finding the number of commissions "drawing" the whole value of turnout at the presidential election of the Russian Federation in 2018Graphs with unusual peaks we now see after every federal election. For the first time they went to the masses after the elections in 201? when people saw falsifications, and got acquainted with the analysis of election data and the problem of integer division in particular.
 
 
Distributions even began to appear their names. This and "Churov's beard" for the 2011 elections, and "Volodin Peak" for the famous 62.2% in Saratov. Since until now even on the habr there are articles, not familiar with the solution of the problem of integer division and do not agree " ...
+ 0 -

Parsing 0.5Tb xml for a few hours. Search of organizations by criteria in the register of subjects of SMEs of the Federal Tax Service

Parsing 0.5Tb xml for a few hours. Search of organizations by criteria in the register of subjects of SMEs of the Federal Tax ServiceBy the nature of the activity (automation of processes and the development of the architecture of information systems), one often has to deal with the need to write a script and get the result "here and now" for an unexpectedly "arrived" task in a situation where there is no possibility of promptly attracting external developers.
 
 
The review will be devoted to solving one such problem. At some point, there was a need to analyze, based on the open data of the "Single Register of Small and Medium Enterprises" of the Federal Tax Service (RNSS), the dynamics by months of the number of organizations ...
+ 0 -

AI.Hack St. Petersburg

<{full}>
Hello, Habr! In this post I'll tell you about one of the coolest
hackathon
with the DS-track, held recently in St. Petersburg. Under the cut - a general overview, the cases that we decided, and, of course, about how both teams of the AU were able to become winners.
 
 
AI.Hack St. Petersburg
 
hackathon
with the DS-track, held recently in St. Petersburg. Under the cut - a general overview, the cases that we decided, and, of course, about how both teams of the AU were able to become winners.
 
 

 
 

Introduction


 
This is the third post from ...[/h]
+ 0 -

Council on Open dаta: Openness of Rosreestra and Federal Property Management Agency, results of 2017 and plans for the future

Council on Open Data: Openness of Rosreestra and Federal Property Management Agency, results of 2017 and plans for the future
 
 
At the end of April, the regular meeting of the Council on Open Data was held, the agenda of which was the openness of Rosreestra and Rosimushchestvo, summing up the activities of the Council on open data and plans for the future.
 
site Rosreestra , but to use it (copy data, reproduce, distribute, publish ) without the written permission of Rosreestr is prohibited. According to the specialists' comments, the problem of lack of access to machine-readable data lies not in the cartographic data itself, but in that it takes into account the protection of copyrights to the cartographic substrate (i.e...
+ 0 -