Software development kit 1.1 has been released


 

The installer is modular now! Programmer tools 'malt-tool' and 'malt_sw' system are downloaded individually. Installation directory may be changed by ‘MALT_HOME’ environment variable. By default, ‘malt-tools’ is installed to ‘/opt/MALT’ directory, ‘malt_sw’ - to ‘$HOME/MALT’. GNU development tools have been upgraded to latest versions, including gcc - up to 7.2.0 version. Manycore processor support has been added to MALTemu emulator. Thus, the performance of the emulator is almost in proportion to the number of processor cores per host system. The support for any number of arbitrary size {scalar|array} arguments has been added to MALTCC for both SIMD and slave functions.

 

Read more ...

Qualcomm Centriq - 48-core server-side ARM

Qualcomm company has released 48-core server processor. The processor is developed by using 10 nm Samsung technology which enables to place on 400 mm2 area not only 48 cores but sufficiently large amount of cache memory. Qualcomm processor doesn’t lose to the latest generation of Intel Xeon processors by its characteristics and even surpasses them by some parameters...

 

Read more ...

Graphcore processors for machine learning



Graphcore company has announced the creation of a new type of processors for machine learning and graph operations. They named the design IPU - Intelligence Processing Unit. According to the information from the company’s site, the device is designed for massive parallel computing using numbers with low-precision (perhaps, half-precision) floating point. It is promised, that the processor will have a much higher density of computing elements than already existing solutions in this field.

 

Read more ...

The first samples of MALT-C 9Mb96G have been received from TSMC foundry


 

Samples of MALT-C 9Mb96G processor have been received from TSMC foundry (Taiwan). MALT-C 9Mb96G is the first chip belonging to MALT-C family which has been manufactured in silicon using 28 nanometer TSMC HPCPlus (high-performance computing) technological process. The processor contains 9 general-purpose RISC cores and 96 specialized processor elements integrated in 3 SIMD clusters, 32 elements each. MALT-C 9Mb96G samples have been successfully passed entry tests on test vectors on “FORMULA” chip tester and estimated characteristics have matched with achieved characteristics under full load. Thus, full power consumption at operating frequency of 800 MHz was 1 W.

 

Read more ...

NEC is releasing a new vector accelerator



NEC company is going to release a vector accelerator after a long break. The device named NEC SX-Aurora TSUBASA will be manufactured using 16 nm FinFET technological process. This will be the first processor in the world equipped by six HBM2 memory modules which have been manufactured using CoWoS (Chip-on-Wafer-on-Substrate) technology.

 

Read more ...

Cisco new chip



Cisco company has presented a new specialized processor with two chips: one of them has 672 processors, 4 threads each + 44 MB SRAM + 520 SERDES with 6.5 Tb/s combined bandwidth, another one has multi-channel DRAM memory with its own controller: 1 billion random accesses per second and 37.5 GB/s during sequential read.

 

Read more ...

International Blockchain Forum



On October 12 the International Blockchain Forum has taken place in Moscow. Significant part of the presentations has been devoted to ICO and other financial and legal aspects of cryptocurrency. Still, there have been some interesting topics for computer engineering specialists to discuss.

 

Read more ...

Prasad Saggurti’s visit to MSU



On September 2017, there has been held a meeting of the MALT team with Prasad Saggurti - the director of development department (memory, logic libraries and memory test design) of Synopsys company.

 

Read more ...

The first-generation 96-core processor project has been sent to MPW manufacturer


 

The first-generation 96-core processor project has been sent to get manufactured using MPW. VLSIs are going to be manufactured at TSMC semiconductor foundry (Taiwan) using 28 nanometer TSMC HPCPlus (high-performance computing) technological process. MALT-C 9Mb96G is the first MALT-C chip which will be manufactured “in silicon”. The processor contains 9 general-purpose RISC cores and 96 specialized processor elements integrated into 3 SIMD clusters, 32 elements each.

 

Read more ...

The design of the second-generation processor has been started


 

The design of the second-generation processor has been started. The first version of processor element, which architecture is based on improved Leopard architecture, has been developed. The testing and measurement of energy consumption on target algorithms, considering time delays, have been completed in CAD Cadence at the frequency of 1 GHz for TSMC HPC+ 28 nm manufacturing process.

 

Read more ...

Google Reveals Technical Specs and Business Rationale for TPU Processor



Although Google’s Tensor Processing Unit (TPU) has been powering the company’s vast empire of deep learning products since 2015, very little was known about the custom-built processor. This week the web giant published a description of the chip and explained why it’s an order of magnitude faster and more energy-efficient than the CPUs and GPUs it replaces.

 

Read more ...

Applied Micro Claims Third-Generation ARM Chip Ready to Take on Intel Xeon


Applied Micro announced it is sampling X-Gene 3, its third-generation ARM SoC for servers. According to a report by The Linley Group, the new platform will provide comparable performance to the latest Intel Xeon processors, but at a significantly lower price point.



X-Gene 3 respectable performance profile is a result of its relatively high clock speed and memory bandwidth. The CPU runs its 32 cores at a base frequency of 3.0 GHz, and can achieve 3.3 GHz under turbo mode. To feed those cores with data, the chip includes eight memory channels, which can serve DDR4 devices at up to 2667 MHz, yielding 170 GB/sec of aggregate bandwidth. The SoC also includes 42 lanes of PCIe 3.0 links for external connectivity.


The report states that the X-Gene 3 can handle a “a broad range of cloud workloads, including scale-up and scale-out applications.” It should be particular adept at so-called big data applications like in-memory database processing, thanks to its superior memory bandwidth. Coincidently (or perhaps not), AMD is touting its upcoming “Naples” x86 chip for very similar memory bandwidth capabilities, based on the same 8-channel per socket design.



Read more...

The design of the first-generation 96-core MALT processor’s front end has been completed


 

The design of the first-generation 96-core processor’s front end has been completed for VLSI manufacturing at TSMC 28nm HPC+ factory (Taiwan). The basis is intended for high-performance integrated circuit development (high-performance computing, HPC). The designed processor belongs to MALT-C family. It contains 9 general-purpose RISC cores and 96 SIMD processor elements. Estimated chip area 12 mm2, power consumption 1,2 W at the frequency 0,8 GHz. Estimated date of sample delivery: January, 2018.

 

Read more ...

Japan kicks off AI supercomputer project

 

 

Sunway TaihuLight supercomputer

Image: SUNWAY TAIHULIGHT SYSTEM REPORT

 

Japan has started a project to build the world's fastest supercomputer by the end of 2017.

 

Read more ...

Intel Expands Its Comfort Zone with New ARM-Powered FPGAs for Datacenters



Intel announced it is sampling its Stratix 10 FPGAs, the latest family of field programmable gate arrays that are designed to accelerate a number of datacenter workloads. The new devices, which Intel is calling “the most significant FPGA innovations in over a decade,” offer advanced features like embedded 64-bit ARM processors, second-generation High Bandwidth Memory (HBM2), and DSP blocks





The server applications Intel is targeting with the Stratix 10 family is somewhat tangential to Nvidia's and AMD's newest GPU accelerators, as well as Intel’s own Knights Landing Xeon Phi.. However Intel believes workloads such as signal processing, data compression, data encryption, storage management, and video encoding – in truth tough, practically any server-side application where data throughput is the driving criteria. With the DSP unit offering lots of hardwired flops, these devices can also be used for high performance computing.


Read more...

The debugging set for MALT has been released


 

The first version of tool kit for MALT software development and debugging has been released. The kit includes emulator, debugger and profiler. The emulator enables to execute and debug MALT programs on general-purpose computers running under Unix-like systems. The emulator, its integrated GDB debugger and the profiler significantly simplify development and porting programs on MALT system and also make it possible to evaluate the efficiency of algorithm implementation on MALT without running it on real hardware.

 

Read more ...

Kilocore - World's First 1,000-Processor Chip

 

Image: The University of California

A microchip containing 1,000 independent programmable processors has been designed by a team at the University of California, Davis, Department of Electrical and Computer Engineering.

 

Read more ...