Data Engineering Special Report | |
|
|
|
In this special newsletter we bring you up to date on all the new content and news related to Data Engineering on InfoQ. We are also maintaining a portal page for this content on InfoQ at: https://www.infoq.com/ai-ml-data-eng. |
|
|
|
This white paper delves into different cache types, strategies, and topologies. It examines eviction strategies, Java temporary caching using the JCache API, and introduces Hazelcast IMDG caching. Download now. Sponsored content |
| |
|
Top Viewed Content on InfoQ |
|
|
|
At WWDC Apple released Core ML 2: a new version of their machine learning SDK for iOS devices. The new release of Core ML should create an inference time speedup of 30% for apps developed using Core ML 2. An important new feature of the Core ML SDK is Create ML. Developers can create and train custom machine learning models on their mac. | Google recently introduced ML Kit, a machine-learning module fully integrated in its Firebase mobile development platform and available for both iOS and Android. With this new Firebase module, Google simplifies the creation of machine-learning powered applications on mobile phones and solves some of the challenges of implementing computationally intense features on mobile devices. |
|
This white paper introduces the domain of stream processing, covering use cases, the building blocks of a stream processing solution, and key concepts used when building a streaming pipeline such as: definition of the dataflow, keyed aggregation, windowing. Download now. Sponsored content |
| |
|
Google recently announced Flutter Release Preview 1. Flutter is an open-source framework for cross-platform app development for both iOS and Android. Flutter Release Preview 1 includes support for hardware keyboards and barcode scanners, video recording, ML Kit, an update to the Flutter extension for Visual Studio Code, and more. |
|
The latest version of open-source distributed pub-sub messaging framework Apache Pulsar enables companies to move “beyond batch” by acting on data in motion. Streamlio recently announced the availability of Apache Pulsar 2.0 streaming messaging solution. The new version supports Pulsar Functions, Schema Registry and Topic Compaction. |
|
|
At QCon San Francisco 2016, Neha Narkhede presented “ETL is Dead; Long Live Streams”, and discussed the changing landscape of enterprise data processing. |
| |
|
Machine learning & deep-learning brought data analytics to the developer community. The eMag focuses on the current landscape of ML technologies and presents several associated real-world case studies. |
| |
|
In this article, author Siddharth Teotia discusses the Dremio database which is based on Apache Arrow with vectorization capabilities. |
| |
|
This fall, Wallaroo Labs will be releasing a large new feature set to our distributed data stream processing framework, Wallaroo. |
| |
|
Learn how Spring Boot and Hazelcast IMDG contribute to the microservices landscape, enhancing the benefits and alleviating some of the common downsides of implementing microservices. Download now. Sponsored content |
| |
|
|
Holden Karau discusses how to train models, and how to serve them, including basic validation techniques, A/B tests, and the importance of keeping models up-to-date. |
| |
|
Martin Kleppmann explores how to ensure data consistency in distributed systems, especially in systems that don't have an authoritative leader, and peer-to-peer communication. |
| |
|
Sumedh Pathak talks about his team’s journey to create a more modern relational database, distributed systems, scaling Postgres, distributed query planner and the distributed deadlock detection. |
| |
|
|