Java grafieken voor Opendata

Hoi Weerman,

Based on its use and other modern compression needs, such as web font compression, today we are excited to announce that we have developed and open sourced a new algorithm, the Brotli compression algorithm.

Ik gebruikte altijd Winzip, vaak maakte ik backup van mijn source projectjes met (niet open)rar formaat in WinZip. Ontwikkeling van de proprietary Rar format houdt op en wordt ingehaald door open source varianten. Nu heb ik 7zip.exe geïnstalleerd. N.B. Ik denk dat ik 20 jaar geleden de officiële WinZip applicatie een keer heb gekocht. Zou ik nu nooit meer doen.

And lossless too? I’d prefer if they lost the ads, then the compression wouldn’t be needed.

Nadeel van ad blocker is dat bijvoorbeeld Funda hierna kijkt. Ze denken dat je iemand bent die hun site probeert te kopiëren.

Er zijn ook andere technieken die momenteel gebruikt worden om browsers sneller te maken.

What are protocol buffers?
Protocol buffers are Google’s language-neutral, platform-neutral, extensible mechanism for serializing structured data – think XML, but smaller, faster, and simpler. You define how you want your data to be structured once, then you can use special generated source code to easily write and read your structured data to and from a variety of data streams and using a variety of languages.

Cap’n Proto is an insanely fast data interchange format and capability-based RPC system. Think JSON, except binary. Or thinkProtocol Buffers, except faster. In fact, in benchmarks, Cap’n Proto is INFINITY TIMES faster than Protocol Buffers.

Hier is een diagram van Cap’n Proto

Volgend project is een Distributed Search Engine die ik gevonden heb. Ben nog zoekende naar de beste Distributed Search en Index Engine

  1. Distributed Search Engine with Nanomsg and Bond
    Exploring Microsoft’s open source Bond framework by building a distributed search engine. I’m using bond for serialization/deserialization and nanomsg for communication.

The source for this C++14 project is located at:
The nanomsg library is a simple high-performance implementation of several “scalability protocols”. These scalability protocols are light-weight messaging protocols which can be used to solve a number of very common messaging patterns, such as request/reply, publish/subscribe, surveyor/respondent, and so forth. These protocols can run over a variety of transports such as TCP, UNIX sockets, and even WebSocket.

Bond is a cross-platform framework for working with schematized data. It supports cross-language de/serialization and powerful generic mechanisms for efficiently manipulating data. Bond is broadly used at Microsoft in high scale services.

In de volgende post legt Daniel uit hoe zijn aanpak is.

Naast bovenstaande distributed search project heb ik nog twee andere projecten gevonden

  1. An open source, feature rich search engine dataparksearch.

  2. Xapian is an Open Source Search Engine Library, released under the GPL v2+. It’s written in C++.

De hartelijke groet Jan Marco