Konstantinos Nikitopoulos

Dr Konstantinos Nikitopoulos


Reader - Signal Processing for Wireless Communications
+44 (0)1483 683423
03 CII 01

Biography

Areas of specialism

Massively Parallel Algorithms for Communication Systems; Advanced Physical Layer Architectures; Testbeds and Trials; 5G Systems and Beyond

Affiliations and memberships

The Institute of Electrical and Electronics Engineers (IEEE)
Senior IEEE Member
UK’s Higher Education Academy
Fellow Member

Research

Research interests

Research projects

Supervision

Completed postgraduate research projects I have supervised

My teaching

My publications

Highlights

My recent research highlights include SWORD: a new SoftWare Open Radio Design that is flexible, open for research, low-cost, scalable and software-driven, and able to support advanced large and massive Multiple-Input Multiple-Output (MIMO) approaches, MultiSphere: the first method to massively parallelize the detection of large numbers of mutually interfering information streams, and g-MultiSphere: MultiSphere's generalization for application to non-orthogonal signal transmissions like Non-Orthogonal Multiple Access (NOMA) and Spectrally-Efficient Frequency Division Multiplexing (SE-FDM). The last are outcomes of the MultiSphere Project

An outline of my work is also included in “Massively Parallel and Flexible Signal Processing for large MIMO Systems” in the John Wiley & Sons in Wiley 5G REF: The Essential 5G Reference Online.

My scholarly contributions and academic indexes can be found here.

Publications

Nikitopoulos K, Zhang D, Lai I-W, Ascheid G (2009) Complexity-Efficient Enumeration Techniques for Soft-Input, Soft-Output Sphere Decoding,Nikitopoulos, K., Dan Zhang, I-wei Lai;, Ascheid, G. "Complexity-efficient enumeration techniques for soft-input, soft-output sphere decoding," IEEE Communications Letters, vol.14, no.4, pp.312-314, Apr. 2010
In this paper two complexity efficient soft sphere-decoder modifications are proposed for computing the max-log LLR values in iterative MIMO systems, which avoid the costly, typically needed, full enumeration and sorting (FES) procedure during the tree traversal without compromising the max-log performance. It is shown that despite the resulting increase in the number of expanded nodes, they can be more computationally efficient than the typical soft sphere decoders by avoiding the unnecessary complexity of FES.
Nikitopoulos K, Chatzipanagiotis D, Jayawardena C, Tafazolli R (2016) MultiSphere: Massively Parallel Tree Search for Large Sphere Decoders,IEEE GLOBECOM 2016 Proceedings
?This work introduces MultiSphere, a method to massively parallelize the tree search of large sphere decoders in a nearly-independent manner, without compromising their maximum-likelihood performance, and by keeping the overall processing complexity at the levels of highly-optimized sequential sphere decoders. MultiSphere employs a novel sphere decoder tree partitioning which can adjust to the transmission channel with a small latency overhead. It also utilizes a new method to distribute nodes to parallel sphere decoders and a new tree traversal and enumeration strategy which minimize redundant computations despite the nearly-independent parallel processing of the subtrees. For an 8 × 8 MIMO spatially multiplexed system with 16-QAM modulation and 32 processing elements MultiSphere can achieve a latency reduction of more than an order of magnitude, approaching the processing latency of linear detection methods, while its overall complexity can be even smaller than the complexity of well-known sequential sphere decoders. For 8×8 MIMO systems, MultiSphere?s sphere decoder tree partitioning method can achieve the processing latency of other partitioning schemes by using half of the processing elements. In addition, it is shown that for a multi-carrier system with 64 subcarriers, when performing sequential detection across subcarriers and using MultiSphere with 8 processing elements to parallelize detection, a smaller processing latency is achieved than when parallelizing the detection process by using a single processing element per subcarrier (64 in total).
Nikitopoulos K, Polydoros A (2005) Phase-impairment effects and compensation algorithms for OFDM systems,IEEE Transactions on Communications53(4)pp. 698-707
The simultaneous perturbation of an orthogonal frequency-division multiplexing receiver by phase noise plus a residual frequency offset (due to synchronization errors) is modeled here as a combined phase impairment, whose effect is evaluated analytically for the case of a frequency-selective fading channel. A nonpilot-aided (decision-directed) scheme is proposed, which compensates for the common (over all the subcarriers) phase-impairment effect. By representing the resulting intercarrier interference as an uncorrelated, unequal-variance process in the frequency domain, maximum-likelihood (ML) and approximate ML estimators of the complex-vector and phase-only types are derived and analytically evaluated. The present schemes are also compared with other current methods based on individual phase trackers, one per subcarrier. Finally, two suggestions are introduced for increasing the robustness of the algorithms to tentative-decision errors. It is demonstrated through simulations that the analysis is accurate, and that the proposed schemes achieve error-rate performance close to that of ideal compensation. © 2005 IEEE.
Georgis Georgios, Nikitopoulos Konstantinos, Jamieson K (2017) Geosphere: an Exact Depth-First Sphere Decoder Architecture Scalable to Very Dense Constellations,IEEE Access5pp. 4233-4249 IEEE
This paper presents the algorithmic design, experimental evaluation, and VLSI implementation of Geosphere, a depth-first sphere decoder able to provide the exact maximumlikelihood solution in dense (e.g., 64) and very dense (e.g., 256, 1024) QAM constellations by means of a geometrically inspired enumeration. In general, linear detection methods can be highly effective when the MIMO channel is well-conditioned. However, this is not the case when the size of the MIMO system increases and the number of transmit antennas approaches the number of the receive antennas. Via our WARP testbed implementation we gather indoor channel traces in order to evaluate the performance gains of sphere detection against zero-forcing and MMSE in an actual indoor environment. We show that Geosphere can nearly linearly scale performance with the number of user antennas; in 4 × 4 multi-user MIMO for 256-QAM modulation at 30 dB SNR there is a 1.7× gain over MMSE and 2.4× over zeroforcing and a 14% and 22% respective gain in 2 × 2 systems. In addition, by using a new node labeling based enumeration technique, low-complexity integer arithmetic and fine-grained clock gating, we implement for up to 1024-QAM constellations and compare in terms of area, delay, power characteristics, the Geosphere VLSI architecture and the best-known best-scalable exact ML sphere decoder. Results show that Geosphere is twice as area-efficient and 70% more energy efficient in 1024-QAM. Even for 16-QAM Geosphere is 13% more area efficient than the best-known implementation for 16-QAM and it is at least 80% more area efficient than state-of-the-art K-best detectors for 64-QAM.
Nikitopoulos K (2012) Maximum likelihood detection of spatially multiplexed signals via loosely ordered depth-first sphere decoding, Electronics Letters48(21)pp. 1368-1370
The complexity of depth-first sphere decoders (SDs) is determined by the employed tree search and pruning strategies. Proposed is a new SD approach for maximum-likelihood (ML) detection of spatially multiplexed, high-order, QAM symbols. In contrast to typical ML approaches, the proposed tree traversal skips the computationally intensive requirement of visiting the nodes in ascending order of their partial distances (PDs). Then, a new pruning method efficiently narrows the search space and preserves the ML performance despite the non-ordered tree traversal. This proposed approach results in substantially reduced PD calculations when compared to typical ML SDs and, for high SNRs, the necessary calculations can be reduced down to the number of transmit antennas. © 2012 The Institution of Engineering and Technology.
Zhang D, Lai I-W, Nikitopoulos K, Ascheid G (2010) Informed message update for iterative MIMO demapping and turbo decoding., ISITApp. 873-878 IEEE
Nikitopoulos K, Zhou J, Congdon B, Jamieson K (2015) Geosphere: Consistently turning MIMO capacity into throughput, Computer Communication Review44(4)pp. 631-642
This paper presents the design and implementation of Geosphere, a physical- and link-layer design for access point-based MIMO wireless networks that consistently improves network throughput. To send multiple streams of data in a MIMO system, prior designs rely on a technique called zero-forcing, a way of "nulling" the interference between data streams by mathematically inverting the wireless channel matrix. In general, zero-forcing is highly effective, significantly improving throughput. But in certain physical situations, the MIMO channel matrix can become "poorly conditioned," harming performance. With these situations in mind, Geosphere uses sphere decoding, a more computationally demanding technique that can achieve higher throughput in such channels. To overcome the sphere decoder's computational complexity when sending dense wireless constellations at a high rate, Geosphere introduces search and pruning techniques that incorporate novel geometric reasoning about the wireless constellation. These techniques reduce computational complexity of 256-QAM systems by almost one order of magnitude, bringing computational demands in line with current 16- and 64-QAM systems already realized in ASIC. Geosphere thus makes the sphere decoder practical for the first time in a 4 × 4 MIMO, 256-QAM system. Results from our WARP testbed show that Geosphere achieves throughput gains over multi-user MIMO of 2× in 4 × 4 systems and 47% in 2 × 2 MIMO systems.
Liao C-H, Lai I-W, Nikitopoulos K, Borlenghi F, Kammler D, Witte EM, Zhang D, Chiueh T-D, Ascheid G, Meyr H (2009) Combining orthogonalized partial metrics: Efficient enumeration for soft-input sphere decoder., PIMRCpp. 1287-1291 IEEE
Zhang D, Nikitopoulos K, Lai I-W, Ascheid G, Meyr H (2010) Iterative channel estimation control for MIMO-OFDM Systems., CISSpp. 1-6 IEEE
Nikitopoulos K, Barghi S, Jafarkhani H, Yousefi'zadeh H (2012) Multi-user detection for asynchronous space-frequency block coded schemes in frequency selective environments., GLOBECOMpp. 4308-4313 IEEE
Nikitopoulos K, Ascheid G (2010) Complexity Adjusted Soft-Output Sphere Decoding by Adaptive LLR Clipping, CoRRabs/1011.2113
Nikitopoulos K, Ascheid G (2012) A simple complexity adjustment technique for soft MIMO receivers in broadcasting scenarios., ISWCSpp. 166-170 IEEE
Nikitopoulos K, Ascheid G (2011) Approximate MIMO Iterative Processing with Adjustable Complexity Requirements, IEEE Transactions on Vehicular Technology, vol. 61, no. 2, pp. 639-650, Feb. 2012
Targeting always the best achievable bit error rate (BER) performance in iterative receivers operating over multiple-input multiple-output (MIMO) channels may result in significant waste of resources, especially when the achievable BER is orders of magnitude better than the target performance (e.g., under good channel conditions and at high signal-to-noise ratio (SNR)). In contrast to the typical iterative schemes, a practical iterative decoding framework that approximates the soft-information exchange is proposed which allows reduced complexity sphere and channel decoding, adjustable to the transmission conditions and the required bit error rate. With the proposed approximate soft information exchange the performance of the exact soft information can still be reached with significant complexity gains.
Nikitopoulos K, Ascheid G (2012) Approximate MIMO Iterative Processing With Adjustable Complexity Requirements., IEEE T. Vehicular Technology612pp. 639-650
Nikitopoulos K, Karachalios A, Reisis D (2014) Exact Max-Log MAP Soft-Output Sphere Decoding via Approximate Schnorr-Euchner Enumeration,IEEE Transactions on Vehicular Technology
Nikitopoulos K, Zhou J, Congdon B, Jamieson K (2014) Geosphere: Consistently turning MIMO capacity into throughput, SIGCOMM 2014 - Proceedings of the 2014 ACM Conference on Special Interest Group on Data Communicationpp. 631-642
This paper presents the design and implementation of Geosphere, a physical- and link-layer design for access point-based MIMO wireless networks that consistently improves network throughput. To send multiple streams of data in a MIMO system, prior designs rely on a technique called zero-forcing, a way of "nulling" the interference between data streams by mathematically inverting the wireless channel matrix. In general, zero-forcing is highly effective, significantly improving throughput. But in certain physical situations, the MIMO channel matrix can become "poorly conditioned," harming performance. With these situations in mind, Geosphere uses sphere decoding, a more computationally demanding technique that can achieve higher throughput in such channels. To overcome the sphere decoder's computational complexity when sending dense wireless constellations at a high rate, Geosphere introduces search and pruning techniques that incorporate novel geometric reasoning about the wireless constellation. These techniques reduce computational complexity of 256-QAM systems by almost one order of magnitude, bringing computational demands in line with current 16- and 64-QAM systems already realized in ASIC. Geosphere thus makes the sphere decoder practical for the first time in a 4 x 4 MIMO, 256-QAM system. Results from our WARP testbed show that Geosphere achieves throughput gains over multi-user MIMO of 2x in 4 x 4 systems and 47% in 2 x 2 MIMO systems. © 2014 ACM.
Nikitopoulos K, Ascheid G (2010) Complexity Adjusted Soft-Output Sphere Decoding by Adaptive LLR Clipping, IEEE Communications Letters15
A-posteriori probability (APP) receivers operating over multiple-input, multiple-output channels provide enhanced bit error rate (BER) performance at the cost of increased complexity. However, employing full APP processing over favorable transmission environments, where less efficient approaches may already provide the required performance at a reduced complexity, results in unnecessary processing. For slowly varying channel statistics substantial complexity savings can be achieved by simple adaptive schemes. Such schemes track the BER performance and adjust the complexity of the soft output sphere decoder by adaptively setting the related log-likelihood ratio (LLR) clipping value.
Nikitopoulos K, Ascheid G (2010) MIMO APP Receiver Processing with Performance-Determined Complexity,
Typical receiver processing, targeting always the best achievable bit error rate performance, can result in a waste of resources, especially, when the transmission conditions are such that the best performance is orders of magnitude better than the required. In this work, a processing framework is proposed which allows adjusting the processing requirements to the transmission conditions and the required bit error rate. It applies a-posteriori probability receivers operating over multiple-input multiple-output channels. It is demonstrated that significant complexity savings can be achieved both at the soft, sphere-decoder based detector and the channel decoder with only minor modifications.
Nikitopoulos K, Polydoros A (2007) Inter-Frame, Fine Frequency/Phase Synchronization forSimple Space-Time-Coded OFDM Receivers., IEEE Transactions on Wireless Communications610pp. 3510-3514
Nikitopoulos K, Stefanatos S, Katsaggelos AK (2009) Decision-aided compensation of severe phase-impairment-induced inter-carrier interference in frequency-selective OFDM., IEEE Transactions on Wireless Communications84pp. 1614-1619
Lai I-W, Liao C-H, Witte EM, Kammler D, Borlenghi F, Nikitopoulos K, Ramakrishnan V, Zhang D, Chiueh T-D, Ascheid G, Meyr H (2009) Searching in the Delta Lattice: An Efficient MIMO Detection for Iterative Receivers., GLOBECOMpp. 1-6 IEEE
Husmann C, Georgis G, Nikitopoulos K, Jamieson K (2017) FlexCore: Massively Parallel and Flexible Processing for Large MIMO Access Points,Proceedings of the 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI ?17)pp. 197-211
Large MIMO base stations remain among wireless network designers? best tools for increasing wireless throughput while serving many clients, but current system designs, sacrifice throughput with simple linear MIMO detection algorithms. Higher-performance detection techniques are known, but remain off the table because these systems parallelize their computation at the level of a whole OFDM subcarrier, sufficing only for the lessdemanding linear detection approaches they opt for. This paper presents FlexCore, the first computational architecture capable of parallelizing the detection of large numbers of mutually-interfering information streams at a granularity below individual OFDM subcarriers, in a nearly-embarrassingly parallel manner while utilizing any number of available processing elements. For 12 clients sending 64-QAM symbols to a 12-antenna base station, our WARP testbed evaluation shows similar network throughput to the state-of-the-art while using an order of magnitude fewer processing elements. For the same scenario, our combined WARP-GPU testbed evaluation demonstrates a 19× computational speedup, with 97% increased energy efficiency when compared with the state of the art. Finally, for the same scenario, an FPGAbased comparison between FlexCore and the state of the art shows that FlexCore can achieve up to 96% better energy efficiency, and can offer up to 32× the processing throughput.
Nikitopoulos K, Mehran F, Jafarkhani H (2017) Space-Time Super-Modulation and its Application to Joint Medium Access and Rateless Transmission,IEEE GLOBECOM 2016 Conference Proceedings IEEE
The paper introduces the concept of Space-Time Super-Modulation according to which additional low rate and highly reliable information can be transmitted by further supermodulating blocks of traditionally modulated and space-time encoded information. This is achieved by exploiting the redundant information introduced by the space-time-block codes and, specifically, by efficiently mapping transmission patterns to specific information content. It is shown that Space-Time SuperModulation can be efficiently used in the context of MachineType-Communications to enable joint medium access and rateless data transmission while minimizing or even eliminating the need for transmitting preamble sequences. Compared with traditional approaches that use encoded preambles or preambles based on Zadoff-Chu sequences to transmit the signature information of transmitted packets, Space-Time Super-Modulation can achieve throughput gains of more than 45% when transmitting blocks of 200 symbols.
Mao J, Wang C, Zhang L, He C, Xiao P, Nikitopoulos K (2018) A DHT-based Multicarrier Modulation System with Pairwise ML Detection,Proceedings of the 2017 IEEE 28th Annual International Symposium on Personal, Indoor, and Mobile Radio Communications (PIMRC)pp. 1-6 IEEE
This paper proposes a complex-valued discrete multicarrier modulation (MCM) system based on the real-valued discrete Hartley transform (DHT) and its inverse (IDHT). Unlike conventional discrete Fourier transform (DFT), DHT can not diagonalize the multipath fading channel due to its inherent properties, which results in the mutual interference between subcarriers in the same mirror-symmetrical pair.We explore the interference pattern in order to seek an optimal solution to utilize the channel diversity for the purpose of enhancing system bit error performance (BEP). It is shown that the optimal channel diversity gain can be achieved via a pairwise maximum likelihood (ML) detection, taking into account not only the subcarrier?s own channel quality but also the channel state of its mirror-symmetrical peer. Performance analysis indicates that DHT-based MCM mitigates the fast fading effect by averaging the channel power gain on the mirror-symmetrical subcarriers. Simulation results show that the proposed system has a substantial improvement in BEP over conventional DFT-Based MCM.
Husmann C, Nikolaou P, Nikitopoulos K (2017) Reduced Latency ML Polar Decoding via Multiple Sphere-Decoding Tree Searches,IEEE Transactions on Vehicular Technology67(2)pp. 1835-1839 IEEE
Sphere decoding (SD) has been proposed as an efficient way to perform maximum-likelihood (ML) decoding of Polar codes. Its latency requirements, however, are determined by its ability to promptly exclude from the ML search (i.e., prune) large parts of the corresponding SD tree, without compromising the ML optimality. Traditional depth-first approaches initially find a ?promising" candidate solution and then prune parts of the tree that cannot result to a ?better" solution. Still, if this candidate solution is far (in terms of Euclidean distance) from the ML one, pruning becomes inefficient and decoding latency explodes. To reduce this processing latency, an early termination approach is, first, introduced that exploits the binary nature of the transmitted information. Then, a simple but very efficient SD approach is proposed that performs multiple tree searches that perform decreasingly aggressive pruning. These searches are almost independent and can take place sequentially, in parallel, or even in a hybrid (sequential/parallel) manner. For Polar codes of 128 block size, both realizations can provide a latency reduction of up to four orders of magnitude compared to state-of-the-art Polar sphere decoders. Then, a further 50% latency reduction can be achieved by exploiting the parallel nature of the approach.
Nikitopoulos Konstantinos, Mehran Farhad, Jafarkhani H (2017) Space-Time Super-Modulation: Concept, Design Rules, and its Application to Joint Medium Access and Rateless Transmission,IEEE Transactions on Wireless Communications16(12)pp. 8275-8288 IEEE
We introduce the concept of Space-Time Super-Modulation according to which additional lowrate and highly reliable information can be transmitted on top of traditionally modulated and spacetime encoded information, without increasing the transmitted block length or degrading their error-rate performance. This is achieved by exploiting the temporal redundancy introduced by the space-time block codes and, specifically, by efficiently mapping transmission patterns to specific information content. We show that Space-Time Super-Modulation can be efficiently used in the context of machine-type communications to enable ?one-shot?, ?grant-free" joint medium access and rateless data transmission while reducing or even eliminating the need for transmitting preamble sequences. As a result, compared with traditional approaches that use correlatable preamble sequences or encoded preambles to transmit the signature information of transmitted packets, Space-Time Super-Modulation can achieve significant throughput gains. For example, we show up to 35% throughput gains from the second best examined preamble-based scheme when transmitting blocks of 200 bits.
He Chang, Xiao Pei, Zhang Lei, Mao Juquan, Cao Aijun, Nikitopoulos Konstantinos (2017) Efficient DCT-MCM Detection for Single and Multi-Antenna Wireless Systems,Proceedings of the 2017 IEEE 28th Annual International Symposium on Personal, Indoor, and Mobile Radio Communications (PIMRC) IEEE
The discrete cosine transform (DCT) based multicarrier modulation (MCM) system is regarded as one of the promising transmission techniques for future wireless communications. By employing cosine basis as orthogonal functions for multiplexing each real-valued symbol with symbol period of T , it is able to maintain the subcarrier orthogonality while reducing frequency spacing to 1/(2T ) Hz, which is only half of that compared to discrete Fourier transform (DFT) based multicarrier systems. In this paper, following one of the effective transmission models by which zeros are inserted as guard sequence and the DCT operation at the receiver is replaced by DFT of double length, we reformulate and evaluate three classic detection methods by appropriately processing the post- DFT signals both for single antenna and multiple-input multipleoutput (MIMO) DCT-MCM systems. In all cases, we show that with our reformulated detection approaches, DCT-MCM schemes can outperform, in terms of error-rate, conventional OFDMbased systems.
Mehran Farhad, Nikitopoulos Konstantinos (2018) Generalized Space-Time Super-Modulation for Headerless Grant-Free Rateless Multiple Access,Proceedings of the IEEE Global Communications Conference 2018 Institute of Electrical and Electronics Engineers (IEEE)
This work introduces Generalized Space-Time Super-Modulation (GSTSM), a generalization of the recently proposed Space-Time Super-Modulation scheme that enables the transmission of additional, highly-reliable information on the top of conventionally transmitted symbols, without increasing the corresponding packet length. GSTSM jointly exploits the spatial and temporal dimensions of multiple-antenna systems but, in contrast to the initially proposed approach, it does not require the use of space-time block codes. Instead, GSTSM jointly elaborates on the concepts of spatial modulation and spatial diversity, while intentionally introducing temporal correlation to the transmitted symbol sequence. In the context of machine-type communications, GSTSM enables one-shot and grant-free medium access without transmitting additional headers to convey each machine?s ID. As a result, we show that GSTSM can provide throughput gains of up to 2.5 X compared to conventional header-based schemes, even in the case of colliding packets.
Husmann Christopher Camilo Mischa, Nikitopoulos Konstantinos (2018) ViPer MIMO: Increasing Large MIMO Efficiency via Practical Vector-Perturbation,Proceedings of IEEE Globecom2018 WC IEEE
Large multi-user MIMO systems with spatial multiplexing are among the most promising approaches for increasing wireless throughput while serving many clients. Yet, the achievable spectral efficiency of current large MIMO systems is limited by the adoption of simple, but sub-optimal, linear precoding techniques (e.g, minimum-mean-square-error (MMSE)). Nonlinear precoding methods, like Vector Perturbation (VP), claim to be able to provide improved network throughput. However, such methods are still purely theoretical and they do not account for the practical aspects of actual wireless systems, as the corresponding complexity and latency requirements, or the need for practical rate adaptation. This paper presents ViPer, the first practical VP-based MIMO system design. ViPer substantially reduces the latency requirements of VP by employing massively parallel processing and realizes a practical rate adaptation method that efficiently translates VP?s signal-to-noise-ratio (SNR) gains into actual throughput gains. In our first systematic experimental evaluation of VP-based precoders, we show that ViPer can deliver in practice up to 30% higher throughput than MMSE precoding with comparable latency requirements. In addition, ViPer can match the performance of state-of-the-art parallel VP precoding schemes, by utilizing less than one tenth of the processing elements.
Husmann Christopher, Tafazolli Rahim, Nikitopoulos Konstantinos Antipodal Detection and Decoding for Large Multi-User MIMO with Reduced Base-Station Antennas,Proceedings of the IEEE GLOBECOM 2018 Workshops Institute of Electrical and Electronics Engineers (IEEE)
To avoid unnecessarily using a massive number of base station antennas to support a large number of users spatially multiplexed multi-user MIMO systems, optimal detection methods are required to demultiplex the mutually interfering information streams. Sphere decoding (SD) can achieve this, but its complexity and latency becomes impractical for large MIMO systems. Low complexity detection solutions such as linear detectors (e.g., MMSE) or likelihood ascendant search (LAS) approaches, have significantly lower latency requirements than SD but their achievable throughput is far from optimal. This work presents the concept of Antipodal detection and decoding, that can deliver very high throughput with practical latency requirements, even in systems where the number of user antennas reaches the number of base station antennas. The Antipodal detector either results in a highly reliable vector solution, or it does not find a vector solution at all (i.e., it results in an erasure), skipping the heavy processing load related to finding vector solutions that have a very high likelihood to be erroneous. Then, a belief-propagation-based decoder is proposed, that restores these erasures and further corrects remaining erroneous vector solutions. We show that for 32å32, 64-QAM modulated systems, and for packet error rates below 10%, Antipodal detection and decoding requires 9 dB less transmitted power than systems employing soft MMSE or LAS detection and LDPC decoding with similar complexity requirements. For the same scenario, our Antipodal method achieves practical throughput gains of more than 50% compared to soft MMSE and soft LAS-based methods.
Nikitopoulos Konstantinos, Georgis Georgios, Jayawardena Chathura, Chatzipanagiotis Daniil, Tafazolli Rahim (2018) Massively Parallel Tree Search for High-Dimensional Sphere Decoders,Transactions on Parallel and Distributed Systems IEEE
The recent paradigm shift towards the transmission of large numbers of mutually interfering information streams, as in the case of aggressive spatial multiplexing, combined with requirements towards very low processing latency despite the frequency plateauing of traditional processors, initiates a need to revisit the fundamental maximum-likelihood (ML) and, consequently, the sphere-decoding (SD) detection problem. This work presents the design and VLSI architecture of MultiSphere; the first method to massively parallelize the tree search of large sphere decoders in a nearly-concurrent manner, without compromising their maximum-likelihood performance, and by keeping the overall processing complexity comparable to that of highly-optimized sequential sphere decoders. For a 10 å 10 MIMO spatially multiplexed system with 16-QAM modulation and 32 processing elements, our MultiSphere architecture can reduce latency by 29å against well-known sequential SDs, approaching the processing latency of linear detection methods, without compromising ML optimality. In MIMO multicarrier systems targeting exact ML decoding, MultiSphere achieves processing latency and hardware efficiency that are orders of magnitude improved compared to approaches employing one SD per subcarrier. In addition, for 16å16 both ?hard?- and ?soft?-output MIMO systems, approximate MultiSphere versions are shown to achieve similar error rate performance with state-of-the art approximate SDs having akin parallelization properties, by using only one tenth of the processing elements, and to achieve up to approximately 9å increased energy efficiency.
Jayawardena Chathura, Nikitopoulos Konstantinos Massively Parallel Detection for Non-Orthogonal Signal Transmissions,Proceedings of the IEEE GLOBECOM 2018 Workshops Institute of Electrical and Electronics Engineers (IEEE)
The increasing demand for massive connectivity with low latency requirements has triggered a paradigm shift towards Non-Orthogonal transmissions. Still, to translate the theoretical gains of Non-Orthogonal transmissions into practical, efficient ?soft? detection schemes are required. The detection latency and/or complexity of state-of-the-art detection methods becomes impractical for large Non-Orthogonal systems, both due to the large number of interfering streams and due to the rank-deficient or ill-determined nature of the corresponding interference matrix. Extending the recently proposed MultiSphere framework, this work introduces NorthCore; a massively parallel sphere-decoding-based scheme for the detection of large and illdetermined Non-Orthogonal systems. Similarly to MultiSphere, NorthCore reduces the corresponding search space by focusing the available processing power to the most promising vector solutions that are processed in parallel. As a result, the proposed detection scheme can attain a detection processing latency similar to that of highly-suboptimal linear detectors and even outperform state-of-the-art sophisticated detection approaches with up to an order of magnitude reduced complexity. To identify the most promising vector solutions, NorthCore introduces a sortfree candidate selection technique that reduces the necessary preprocessing complexity by up to an order of magnitude, making the proposed approach practical.
He Chang, Cao Aijun, Xiao Lixia, Zhang Lei, Xiao Pei, Nikitopoulos Konstantinos (2019) Enhanced DCT-OFDM System With Index Modulation,IEEE Transactions on Vehicular Technology Institute of Electrical and Electronics Engineers (IEEE)
Discrete cosine transform (DCT) based orthogonal frequency division multiplexing (OFDM), which has double number of subcarrier compared to the classic discrete fourier transform (DFT) based OFDM (DFT-OFDM) at the same bandwidth, is a promising high spectral efficiency multicarrier techniques for future wireless communication. In this paper, an enhanced DCT-OFDM with index modulation (IM) (EDCT-OFDM-IM) is proposed to further exploit the benefits of the DCT-OFDM and IM techniques. To be more specific, a pre-filtering method based DCT-OFDM-IM transmitter is first designed and the non-linear maximum likelihood (ML) is developed for our EDCT-OFDM-IM system. Moreover, the average bit error probability (ABEP) of the proposed EDCT-OFDM-IM system is derived, which is confirmed by our simulation results. Both simulation and theoretical results are shown that the proposed EDCT-OFDM-IM system exhibits better bit error rate (BER) performance over the conventional DFT-OFDM-IM and DCT-OFDM-IM counterparts.
He Chang, Cao Aijun, Xiao Lixia, Zhang Lei, Xiao Pei, Nikitopoulos Konstantinos (2019) Index modulation assisted DCT-OFDM with Enhanced Transceiver Design,Proceedings of the 53rd IEEE International Conference on Communications (IEEE ICC 2019) Institute of Electrical and Electronics Engineers (IEEE)
An index modulation (IM) assisted Discrete Cosine Transform based Orthogonal Frequency Division Multiplexing (DCT-OFDM) with Enhanced Transmitter Design (termed as EDCT-OFDM-IM) is proposed. It amalgamates the concept of Discrete Cosine Transform assisted Orthogonal Frequency Division Multiplexing (DCT-OFDM) and Index Modulation (IM) to exploit the design freedom provided by the double number of available subcarrier under the same bandwidth. In the proposed EDCT-OFDM-IM scheme, the maximum likelihood (ML) detector used for symbol bits and index bits recovering is derived and the sophisticated designing guidelines for EDCTOFDM-IM are provided. Based on the derived pairwise error event probability, a theoretical upper bound on the average biterror probability (ABEP) of EDCT-OFDM-IM is provided over multipath fading channels. Furthermore, the maximum peak-toaverage power ratio (PAPR) of our proposed EDCT-OFDM-IM scheme is derived and compared to than the general Discrete Fourier Transform (DFT) based OFDM-IM counterpart.
Payami Sohail, Sellathurai Mathini, Nikitopoulos Konstantinos (2019) Low-Complexity Hybrid Beamforming for Massive MIMO Systems in Frequency-Selective Channels,IEEE Access7pp. 36195-36206 IEEE
Hybrid beamforming for frequency-selective channels is a challenging problem, as the phase shifters provide the same phase shift to all the subcarriers. The existing approaches solely rely on the channel?s frequency response, and the hybrid beamformers maximize the average spectral efficiency over the whole frequency band. Compared to state-of-the-art, we show that substantial sum-rate gains can be achieved, both for rich and sparse scattering channels, by jointly exploiting the frequency- and time-domain characteristics of the massive multiple-input multiple-output (MIMO) channels. In our proposed approach, the radio frequency (RF) beamformer coherently combines the received symbols in the time domain and, thus, it concentrates the signal?s power on a specific time sample. As a result, the RF beamformer flattens the frequency response of the ?effective? transmission channel and reduces its root-mean-square delay spread. Then, a baseband combiner mitigates the residual interference in the frequency domain. We present the closed-form expressions of the proposed beamformer and its performance by leveraging the favorable propagation condition of massive MIMO channels, and we prove that our proposed scheme can achieve the performance of fully digital zero-forcing when the number of employed phases shifter networks is twice the resolvable multipath components in the time domain.characteristics of the massive multiple-input multiple-output (MIMO) channels. In our proposed approach, the radio frequency (RF) beamformer coherently combines the received symbols in the time domain and, thus, it concentrates the signal's power on a specific time sample. As a result, the RF beamformer flattens the frequency response of the ``effective'' transmission channel and reduces its root-mean-square delay spread. Then, a baseband combiner mitigates the residual interference in the frequency domain. We present the closed-form expressions of the proposed beamformer and its performance by leveraging the favorable propagation condition of massive MIMO channels, and we prove that our proposed scheme can achieve the performance of fully digital zero-forcing when the number of employed phases shifter networks is twice the resolvable multipath components in the time domain.
Mehran F., Nikitopoulos K., Xiao P., Chen Q. (2015) Rateless wireless systems: gains, approaches, and challenges,2015 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP) Proceedingspp. 751-755 IEEE
State-of-the-art channel coding schemes promise data rates close to the wireless channel capacity. However, efficient link adaptation techniques are required in order to deliver such throughputs in practice. Traditional rate adaptation schemes, which are reactive and try to ?predict? the transmission mode that maximizes throughput based on ?transmission quality indicators?, can be highly inefficient in an evolving wireless ecosystem where transmission can become increasingly dynamic and unpredictable. In such scenarios, ?rateless? link adaptation can be highly beneficial. Here, we compare popular rateless approaches in terms of gains and practicality in both traditional and more challenging operating scenarios. We also discuss challenges that need to be addressed to make such systems practical for future wireless communication systems.
Jayawardena Chathura, Nikitopoulos Konstantinos (2019) G-MultiSphere: Generalizing Massively Parallel Detection for Non-Orthogonal Signal Transmissions,IEEE Transactions on Communicationspp. 1-12 Institute of Electrical and Electronics Engineers (IEEE)
The increasing demand for connectivity and throughput, despite the spectrum limitations, has triggered a paradigm shift towards non-orthogonal signal transmissions. However, the complexity requirements of near-optimal detection methods for such systems becomes impractical, due to the large number of mutually interfering streams and to the rank-deficient or ill-determined nature of the corresponding interference matrix. This work introduces g-MultiSphere; a generic massively parallel and near-optimal sphere-decoding-based approach that, in contrast to prior work, applies to both well- and ill-determined non-orthogonal systems. We show that g-MultiSphere is the first approach that can support large uplink multi-user MIMO systems with numbers of concurrently transmitting users that exceed the number of receive antennas by a factor of two or more, while attaining throughput gains of up to 60% and with reduced complexity requirements in comparison to known approaches. By eliminating the need for sparse signal transmissions for nonorthogonal multiple access (NOMA) schemes, g-MultiSphere can support more users than existing systems with better detection performance and practical complexity requirements. In comparison to state- of-the-art detectors for NOMA schemes and nonorthogonal signal waveforms (e.g., SEFDM) g-MultiSphere can be up to an order of magnitude less complex, and can provide throughput gains of up to 60%.
Payami Sohail, Khalily Mohsen, Taheri Sohail, Nikitopoulos Konstantinos, Tafazolli Rahim (2020) Channel Measurement and Analysis for Polarimetric Wideband Outdoor Scenarios at 26 GHz: Directional vs Omni-Directional,EUCAP 2020
This paper presents the measurement results and analysis for outdoor wireless propagation channels at 26 GHz over 2 GHz bandwidth for two receiver antenna polarization modes. The angular and wideband properties of directional and virtually omni-directional channels, such as angular spread, root-mean-square delay spread and coherence bandwidth, are analyzed. The results indicate that the reflections can have a significant contribution in some realistic scenarios and increase the angular and delay spreads, and reduce the coherence bandwidth of the channel. The analysis in this paper also show that using a directional transmission can result in an almost frequencyflat fading channel over the measured 2 GHz bandwidth; which consequently has a major impact on the choice of system design choices such as beamforming and transmission numerology.
Georgis Georgios, Thanos Alexios, Filo Marcin, Nikitopoulos Konstantinos (2020) A DSP ACCELERATION FRAMEWORK FOR SOFTWARE-DEFINED RADIOS ON X86 64,ICASSP 2020
This paper presents a DSP acceleration and assessment framework targeting SDR platforms on x86 64 architectures. Driven by the potential of rapid prototyping and evaluation of breakthrough concepts that these platforms provide, our work builds upon the wellknown OpenAirInterface codebase, extending it for advanced, previously unsupported modes towards large and massive MIMO such as non-codebook-based multi-user transmissions. We then develop an acceleration/profiling framework, through which we present finegrained execution results for DSP operations. Incorporating the latest SIMD instructions, our acceleration framework achieves a unitary speedup of up to 10. Integrated into OpenAirInterface, it accelerates computationally expensive MIMO operations by up to 88% across tested modes. Besides resulting in a useful tool for the community, this work provides insight on runtime DSP complexity and the potential of modern x86 64 systems.
Georgis Georgios, Filo Marcin, Thanos Alexios, Husmann Christopher, De Luna Ducoing Juan Carlos, Tafazolli Rahim, Nikitopoulos Konstantinos (2019) SWORD: Towards a Soft and Open Radio Design for Rapid Development, Profiling, Validation and Testing,IEEE Access Institute of Electrical and Electronics Engineers
The vision, as we move to future wireless communication systems, embraces diverse qualities targeting significant enhancements from the spectrum, to user experience. Newly-defined air-interface features, such as large number of base station antennas and computationally complex physical layer approaches come with a non-trivial development effort, especially when scalability and flexibility need to be factored in. In addition, testing those features without commercial, off-the-shelf equipment has a high deployment, operational and maintenance cost. On one hand, industry-hardened solutions are inaccessible to the research community due to restrictive legal and financial licensing. On the other hand, researchgrade real-time solutions are either lacking versatility, modularity and a complete protocol stack, or, for those that are full-stack and modular, only the most elementary transmission modes are on offer (e.g., very low number of base station antennas). Aiming to address these shortcomings towards an ideal research platform, this paper presents SWORD, a SoftWare Open Radio Design that is flexible, open for research, low-cost, scalable and software-driven, able to support advanced large and massive Multiple-Input Multiple- Output (MIMO) approaches. Starting with just a single-input single-output air-interface and commercial off-the-shelf equipment, we create a software-intensive baseband platform that, together with an acceleration/ profiling framework, can serve as a research-grade base station for exploring advancements towards future wireless systems and beyond.
Payami Sohail, Khalily Mohsen, Loh Tian Hong, Nikitopoulos Konstantinos (2020) Hybrid Beamforming with Switches and Phase Shifters over Frequency-Selective Channels,IEEE Wireless Communications Letterspp. 1-1 Institute of Electrical and Electronics Engineers (IEEE)
The recent studies on hybrid beamformers with a combination of switches and phase shifters indicate that such methods can reduce the cost and power consumption of massive multiple-input multiple-output (MIMO) systems. However, most of the works have focused on the scenarios with frequency-flat channel models. This letter proposes an effective approach for such systems in frequency-selective channels and presents the closed-form expressions of the beamformer and the corresponding sum-rates. Compared to the traditional subconnected structures, our approach with a significantly smaller number of phase shifters results in a promising performance.
Mao Juquan, Zhang Lei, Xiao Pei, Nikitopoulos Konstantinos (2020) Filtered OFDM: An Insight into Intrinsic In-Band Interference and Filter Frequency Response Selectivity,IEEE Access8 Institute of Electrical and Electronics Engineers
The future mobile networks will face challenges in support of heterogeneous services over a unified physical layer, calling for a waveform with good frequency localization. Filtered orthogonal frequency division multiplexing (f-OFDM), as a representative subband filtered waveform, can be employed to improve the spectrum localization of orthogonal frequency-division multiplexing (OFDM) signal. However, the applied filtering operations will impact the performance in various aspects, especially for narrow subband cases. Unlike existing studies which mainly focus its benefits, this paper investigates two negative consequences inflicted on single subband f-OFDM systems: in-band interference and filter frequency response (FFR) selectivity. The exact-form expression for the in-band interference is derived, and the effect of FFR selectivity is analyzed for both single antenna and multiple antenna cases. The in-band interference-free and nearly-free conditions for f-OFDM systems are studied. A low-complexity blockwise parallel interference cancellation (BwPIC) algorithm and a pre-equalizer are proposed to tackle the two issues caused by the filtering operations, respectively. Numerical results show that narrower subbands suffer more performance degradation compared to wider bands. In addition, the proposed BwPIC algorithm effectively suppresses interference, and pre-equalized f-OFDM (pf-OFDM) considerably outperforms f- OFDM in both single antenna and multi-antenna systems.
The increasing demand for connectivity and throughput, combined with the tight latency requirements of current communication systems, and the existing spectrum limitations, has triggered a paradigm shift towards non-orthogonal signal transmissions where multiple information streams are transmitted using the same time/frequency resources. Despite the promising theoretical gains of such transmissions, the complexity and/or latency requirements of the corresponding receiver processing techniques that are required to translate these gains into throughput make their realization impractical, especially for large numbers of mutually interfering information streams. In addition to the processing complexity/latency increase related to systems? high dimensionality, the interference matrix of some of the recently proposed non-orthogonal transmission schemes can either be ill-determined or even rank-deficient, making their detection even more challenging. These requirements combined with the saturating speed of processors, motivates a timely requirement for massively parallel processing of such non-orthogonal transmissions. In this context, this thesis introduces a generic, massively parallel and near-optimal processing framework that applies to both well- and ill-determined non-orthogonal systems. In contrast to known approaches, this framework enables practical large uplink multi-user MIMO systems with numbers of concurrently transmitting users that exceed the number of receive antennas by a factor of two or more. In contrast to traditional approaches, the proposed framework, does not require sparse signal transmissions for the detection of non-orthogonal multiple access (NOMA) schemes since it is not based on the ?Message Passing? algorithm. Consequently, the proposed framework can enable more efficient NOMA approaches that support more users than existing systems, with better detection performance and practical complexity requirements. In comparison to state-of-the-art detectors for NOMA schemes and non-orthogonal signal waveforms(e.g., Spectrally efficient FDM) the proposed scheme can be up to an order of magnitude less complex and can provide throughput gains of up to 60%.This thesis also introduces a massively parallel soft-input soft-output (SISO) detection design for large MIMO systems capable of bridging the gap between theoretical capacity and achievable throughput, with a processing complexity that can be an order of magnitude lower than that of highly optimized sequential SISO detectors, and a processing latency similar to that of highly sub-optimal, linear, SISO detection approaches. Finally, a massively parallel processing framework is presented that enables extreme grant-free non-orthogonal multiple access. This framework allows reliable and low-overhead user identification and reliable detection/decoding with complexity requirements that can be orders of magnitude lower than existing schemes.
Payami Sohail, Khalily Mohsen, Araghi Ali, Hong Loh Tian, Cheadle David, Nikitopoulos Konstantinos, Tafazolli Rahim (2020) Developing the First mmWave Fully-Connected Hybrid Beamformer with a Large Antenna Array,IEEE Access Institute of Electrical and Electronics Engineers
Millimeter wave (mmWave) systems with effective beamforming capability play a key role in fulfilling the high data-rate demands of current and future wireless technologies. Hybrid analog-todigital beamformers have been identified as a cost-effective and energy-efficient solution towards deploying such systems. Most of the existing hybrid beamforming architectures rely on a subconnected phase shifter network with a large number of antennas. Such approaches, however, cannot fully exploit the advantages of large arrays. On the other hand, the current fully-connected beamformers accommodate only a small number of antennas, which substantially limits their beamforming capabilities. In this paper, we present a mmWave hybrid beamformer testbed with a fully-connected network of phase shifters and adjustable attenuators and a large number of antenna elements. To our knowledge, this is the first platform that connects two RF inputs from the baseband to a 16ý 8 antenna array, and it operates at 26 GHz with a 2 GHz bandwidth. It provides a wide scanning range of 60ý, and the flexibility to control both the phase and the amplitude of the signals between each of the RF chains and antennas. This beamforming platform can be used in both short and long-range communications with linear equivalent isotropically radiated power (EIRP) variation between 10 dBm and 60 dBm. In this paper, we present the design, calibration procedures and evaluations of such a complex system as well as discussions on the critical factors to consider for their practical implementation.