Konstantinos Nikitopoulos

Dr Konstantinos Nikitopoulos


Senior Lecturer (Associate Prof.) - Signal Processing for Wireless Communications
+44 (0)1483 683423
03 CII 01

Academic and research departments

Department of Electrical and Electronic Engineering.

Biography

Areas of specialism

Massively Parallel Algorithms for Communication Systems; Advanced Physical Layer Architectures; Testbeds and Trials; 5G Systems and Beyond

Affiliations and memberships

The Institute of Electrical and Electronics Engineers (IEEE)
Senior IEEE Member
UK’s Higher Education Academy
Fellow Member

Research

Research interests

Research projects

Supervision

Completed postgraduate research projects I have supervised

My teaching

My publications

Highlights

My recent research highlights include SWORD: a new SoftWare Open Radio Design that is flexible, open for research, low-cost, scalable and software-driven, and able to support advanced large and massive Multiple-Input Multiple-Output (MIMO) approaches, MultiSphere: the first method to massively parallelize the detection of large numbers of mutually interfering information streams, and g-MultiSphere: MultiSphere's generalization for application to non-orthogonal signal transmissions like Non-Orthogonal Multiple Access (NOMA) and Spectrally-Efficient Frequency Division Multiplexing (SE-FDM). The last are outcomes of the MultiSphere Project

An outline of my work is also included in “Massively Parallel and Flexible Signal Processing for large MIMO Systems” in the John Wiley & Sons in Wiley 5G REF: The Essential 5G Reference Online.

My scholarly contributions and academic indexes can be found here.

Publications

Nikitopoulos K, Zhang D, Lai I-W, Ascheid G (2009) Complexity-Efficient Enumeration Techniques for Soft-Input, Soft-Output Sphere Decoding, Nikitopoulos, K., Dan Zhang, I-wei Lai;, Ascheid, G. "Complexity-efficient enumeration techniques for soft-input, soft-output sphere decoding," IEEE Communications Letters, vol.14, no.4, pp.312-314, Apr. 2010
In this paper two complexity efficient soft sphere-decoder modifications are proposed for computing the max-log LLR values in iterative MIMO systems, which avoid the costly, typically needed, full enumeration and sorting (FES) procedure during the tree traversal without compromising the max-log performance. It is shown that despite the resulting increase in the number of expanded nodes, they can be more computationally efficient than the typical soft sphere decoders by avoiding the unnecessary complexity of FES.
Nikitopoulos K, Chatzipanagiotis D, Jayawardena C, Tafazolli R (2016) MultiSphere: Massively Parallel Tree Search for Large Sphere Decoders, IEEE GLOBECOM 2016 Proceedings
?This work introduces MultiSphere, a method to massively parallelize the tree search of large sphere decoders in a nearly-independent manner, without compromising their maximum-likelihood performance, and by keeping the overall processing complexity at the levels of highly-optimized sequential sphere decoders. MultiSphere employs a novel sphere decoder tree partitioning which can adjust to the transmission channel with a small latency overhead. It also utilizes a new method to distribute nodes to parallel sphere decoders and a new tree traversal and enumeration strategy which minimize redundant computations despite the nearly-independent parallel processing of the subtrees. For an 8 × 8 MIMO spatially multiplexed system with 16-QAM modulation and 32 processing elements MultiSphere can achieve a latency reduction of more than an order of magnitude, approaching the processing latency of linear detection methods, while its overall complexity can be even smaller than the complexity of well-known sequential sphere decoders. For 8×8 MIMO systems, MultiSphere?s sphere decoder tree partitioning method can achieve the processing latency of other partitioning schemes by using half of the processing elements. In addition, it is shown that for a multi-carrier system with 64 subcarriers, when performing sequential detection across subcarriers and using MultiSphere with 8 processing elements to parallelize detection, a smaller processing latency is achieved than when parallelizing the detection process by using a single processing element per subcarrier (64 in total).
Nikitopoulos K, Polydoros A (2005) Phase-impairment effects and compensation algorithms for OFDM systems, IEEE Transactions on Communications 53 (4) pp. 698-707
The simultaneous perturbation of an orthogonal frequency-division multiplexing receiver by phase noise plus a residual frequency offset (due to synchronization errors) is modeled here as a combined phase impairment, whose effect is evaluated analytically for the case of a frequency-selective fading channel. A nonpilot-aided (decision-directed) scheme is proposed, which compensates for the common (over all the subcarriers) phase-impairment effect. By representing the resulting intercarrier interference as an uncorrelated, unequal-variance process in the frequency domain, maximum-likelihood (ML) and approximate ML estimators of the complex-vector and phase-only types are derived and analytically evaluated. The present schemes are also compared with other current methods based on individual phase trackers, one per subcarrier. Finally, two suggestions are introduced for increasing the robustness of the algorithms to tentative-decision errors. It is demonstrated through simulations that the analysis is accurate, and that the proposed schemes achieve error-rate performance close to that of ideal compensation. © 2005 IEEE.
Georgis Georgios, Nikitopoulos Konstantinos, Jamieson K (2017) Geosphere: an Exact Depth-First Sphere Decoder Architecture Scalable to Very Dense Constellations, IEEE Access 5 pp. 4233-4249 IEEE
This paper presents the algorithmic design, experimental evaluation, and VLSI implementation of Geosphere, a depth-first sphere decoder able to provide the exact maximumlikelihood solution in dense (e.g., 64) and very dense (e.g., 256, 1024) QAM constellations by means of a geometrically inspired enumeration. In general, linear detection methods can be highly effective when the MIMO channel is well-conditioned. However, this is not the case when the size of the MIMO system increases and the number of transmit antennas approaches the number of the receive antennas. Via our WARP testbed implementation we gather indoor channel traces in order to evaluate the performance gains of sphere detection against zero-forcing and MMSE in an actual indoor environment. We show that Geosphere can nearly linearly scale performance with the number of user antennas; in 4 × 4 multi-user MIMO for 256-QAM modulation at 30 dB SNR there is a 1.7× gain over MMSE and 2.4× over zeroforcing and a 14% and 22% respective gain in 2 × 2 systems. In addition, by using a new node labeling based enumeration technique, low-complexity integer arithmetic and fine-grained clock gating, we implement for up to 1024-QAM constellations and compare in terms of area, delay, power characteristics, the Geosphere VLSI architecture and the best-known best-scalable exact ML sphere decoder. Results show that Geosphere is twice as area-efficient and 70% more energy efficient in 1024-QAM. Even for 16-QAM Geosphere is 13% more area efficient than the best-known implementation for 16-QAM and it is at least 80% more area efficient than state-of-the-art K-best detectors for 64-QAM.
Nikitopoulos K (2012) Maximum likelihood detection of spatially multiplexed signals via loosely ordered depth-first sphere decoding, Electronics Letters 48 (21) pp. 1368-1370
The complexity of depth-first sphere decoders (SDs) is determined by the employed tree search and pruning strategies. Proposed is a new SD approach for maximum-likelihood (ML) detection of spatially multiplexed, high-order, QAM symbols. In contrast to typical ML approaches, the proposed tree traversal skips the computationally intensive requirement of visiting the nodes in ascending order of their partial distances (PDs). Then, a new pruning method efficiently narrows the search space and preserves the ML performance despite the non-ordered tree traversal. This proposed approach results in substantially reduced PD calculations when compared to typical ML SDs and, for high SNRs, the necessary calculations can be reduced down to the number of transmit antennas. © 2012 The Institution of Engineering and Technology.
Zhang D, Lai I-W, Nikitopoulos K, Ascheid G (2010) Informed message update for iterative MIMO demapping and turbo decoding., ISITA pp. 873-878 IEEE
Nikitopoulos K, Zhou J, Congdon B, Jamieson K (2015) Geosphere: Consistently turning MIMO capacity into throughput, Computer Communication Review 44 (4) pp. 631-642
This paper presents the design and implementation of Geosphere, a physical- and link-layer design for access point-based MIMO wireless networks that consistently improves network throughput. To send multiple streams of data in a MIMO system, prior designs rely on a technique called zero-forcing, a way of "nulling" the interference between data streams by mathematically inverting the wireless channel matrix. In general, zero-forcing is highly effective, significantly improving throughput. But in certain physical situations, the MIMO channel matrix can become "poorly conditioned," harming performance. With these situations in mind, Geosphere uses sphere decoding, a more computationally demanding technique that can achieve higher throughput in such channels. To overcome the sphere decoder's computational complexity when sending dense wireless constellations at a high rate, Geosphere introduces search and pruning techniques that incorporate novel geometric reasoning about the wireless constellation. These techniques reduce computational complexity of 256-QAM systems by almost one order of magnitude, bringing computational demands in line with current 16- and 64-QAM systems already realized in ASIC. Geosphere thus makes the sphere decoder practical for the first time in a 4 × 4 MIMO, 256-QAM system. Results from our WARP testbed show that Geosphere achieves throughput gains over multi-user MIMO of 2× in 4 × 4 systems and 47% in 2 × 2 MIMO systems.
Liao C-H, Lai I-W, Nikitopoulos K, Borlenghi F, Kammler D, Witte EM, Zhang D, Chiueh T-D, Ascheid G, Meyr H (2009) Combining orthogonalized partial metrics: Efficient enumeration for soft-input sphere decoder., PIMRC pp. 1287-1291 IEEE
Zhang D, Nikitopoulos K, Lai I-W, Ascheid G, Meyr H (2010) Iterative channel estimation control for MIMO-OFDM Systems., CISS pp. 1-6 IEEE
Nikitopoulos K, Barghi S, Jafarkhani H, Yousefi'zadeh H (2012) Multi-user detection for asynchronous space-frequency block coded schemes in frequency selective environments., GLOBECOM pp. 4308-4313 IEEE
Nikitopoulos K, Ascheid G (2010) Complexity Adjusted Soft-Output Sphere Decoding by Adaptive LLR Clipping, CoRR abs/1011.2113
Nikitopoulos K, Ascheid G (2012) A simple complexity adjustment technique for soft MIMO receivers in broadcasting scenarios., ISWCS pp. 166-170 IEEE
Nikitopoulos K, Ascheid G (2011) Approximate MIMO Iterative Processing with Adjustable Complexity
Requirements,
IEEE Transactions on Vehicular Technology, vol. 61, no. 2, pp.
639-650, Feb. 2012
Targeting always the best achievable bit error rate (BER) performance in
iterative receivers operating over multiple-input multiple-output (MIMO)
channels may result in significant waste of resources, especially when the
achievable BER is orders of magnitude better than the target performance (e.g.,
under good channel conditions and at high signal-to-noise ratio (SNR)). In
contrast to the typical iterative schemes, a practical iterative decoding
framework that approximates the soft-information exchange is proposed which
allows reduced complexity sphere and channel decoding, adjustable to the
transmission conditions and the required bit error rate. With the proposed
approximate soft information exchange the performance of the exact soft
information can still be reached with significant complexity gains.
Nikitopoulos K, Ascheid G (2012) Approximate MIMO Iterative Processing With Adjustable Complexity Requirements., IEEE T. Vehicular Technology 61 2 pp. 639-650
Nikitopoulos K, Karachalios A, Reisis D (2014) Exact Max-Log MAP Soft-Output Sphere Decoding via Approximate Schnorr-Euchner Enumeration, IEEE Transactions on Vehicular Technology
Nikitopoulos K, Zhou J, Congdon B, Jamieson K (2014) Geosphere: Consistently turning MIMO capacity into throughput, SIGCOMM 2014 - Proceedings of the 2014 ACM Conference on Special Interest Group on Data Communication pp. 631-642
This paper presents the design and implementation of Geosphere, a physical- and link-layer design for access point-based MIMO wireless networks that consistently improves network throughput. To send multiple streams of data in a MIMO system, prior designs rely on a technique called zero-forcing, a way of "nulling" the interference between data streams by mathematically inverting the wireless channel matrix. In general, zero-forcing is highly effective, significantly improving throughput. But in certain physical situations, the MIMO channel matrix can become "poorly conditioned," harming performance. With these situations in mind, Geosphere uses sphere decoding, a more computationally demanding technique that can achieve higher throughput in such channels. To overcome the sphere decoder's computational complexity when sending dense wireless constellations at a high rate, Geosphere introduces search and pruning techniques that incorporate novel geometric reasoning about the wireless constellation. These techniques reduce computational complexity of 256-QAM systems by almost one order of magnitude, bringing computational demands in line with current 16- and 64-QAM systems already realized in ASIC. Geosphere thus makes the sphere decoder practical for the first time in a 4 x 4 MIMO, 256-QAM system. Results from our WARP testbed show that Geosphere achieves throughput gains over multi-user MIMO of 2x in 4 x 4 systems and 47% in 2 x 2 MIMO systems. © 2014 ACM.
Nikitopoulos K, Ascheid G (2010) Complexity Adjusted Soft-Output Sphere Decoding by Adaptive LLR Clipping, IEEE Communications Letters 15
A-posteriori probability (APP) receivers operating over multiple-input,
multiple-output channels provide enhanced bit error rate (BER) performance at
the cost of increased complexity. However, employing full APP processing over
favorable transmission environments, where less efficient approaches may
already provide the required performance at a reduced complexity, results in
unnecessary processing. For slowly varying channel statistics substantial
complexity savings can be achieved by simple adaptive schemes. Such schemes
track the BER performance and adjust the complexity of the soft output sphere
decoder by adaptively setting the related log-likelihood ratio (LLR) clipping
value.
Nikitopoulos K, Ascheid G (2010) MIMO APP Receiver Processing with Performance-Determined Complexity,
Typical receiver processing, targeting always the best achievable bit error
rate performance, can result in a waste of resources, especially, when the
transmission conditions are such that the best performance is orders of
magnitude better than the required. In this work, a processing framework is
proposed which allows adjusting the processing requirements to the transmission
conditions and the required bit error rate. It applies a-posteriori probability
receivers operating over multiple-input multiple-output channels. It is
demonstrated that significant complexity savings can be achieved both at the
soft, sphere-decoder based detector and the channel decoder with only minor
modifications.
Nikitopoulos K, Polydoros A (2007) Inter-Frame, Fine Frequency/Phase Synchronization forSimple Space-Time-Coded OFDM Receivers., IEEE Transactions on Wireless Communications 6 10 pp. 3510-3514
Nikitopoulos K, Stefanatos S, Katsaggelos AK (2009) Decision-aided compensation of severe phase-impairment-induced inter-carrier interference in frequency-selective OFDM., IEEE Transactions on Wireless Communications 8 4 pp. 1614-1619
Lai I-W, Liao C-H, Witte EM, Kammler D, Borlenghi F, Nikitopoulos K, Ramakrishnan V, Zhang D, Chiueh T-D, Ascheid G, Meyr H (2009) Searching in the Delta Lattice: An Efficient MIMO Detection for Iterative Receivers., GLOBECOM pp. 1-6 IEEE
Husmann C, Georgis G, Nikitopoulos K, Jamieson K (2017) FlexCore: Massively Parallel and Flexible Processing for Large MIMO Access Points, Proceedings of the 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI ?17) pp. 197-211
Large MIMO base stations remain among wireless network designers? best tools for increasing wireless throughput while serving many clients, but current system designs, sacrifice throughput with simple linear MIMO detection algorithms. Higher-performance detection techniques are known, but remain off the table because these systems parallelize their computation at the level of a whole OFDM subcarrier, sufficing only for the lessdemanding linear detection approaches they opt for. This paper presents FlexCore, the first computational architecture capable of parallelizing the detection of large numbers of mutually-interfering information streams at a granularity below individual OFDM subcarriers, in a nearly-embarrassingly parallel manner while utilizing any number of available processing elements. For 12 clients sending 64-QAM symbols to a 12-antenna base station, our WARP testbed evaluation shows similar network throughput to the state-of-the-art while using an order of magnitude fewer processing elements. For the same scenario, our combined WARP-GPU testbed evaluation demonstrates a 19× computational speedup, with 97% increased energy efficiency when compared with the state of the art. Finally, for the same scenario, an FPGAbased comparison between FlexCore and the state of the art shows that FlexCore can achieve up to 96% better energy efficiency, and can offer up to 32× the processing throughput.
Nikitopoulos K, Mehran F, Jafarkhani H (2017) Space-Time Super-Modulation and its Application to Joint Medium Access and Rateless Transmission, IEEE GLOBECOM 2016 Conference Proceedings IEEE
The paper introduces the concept of Space-Time Super-Modulation according to which additional low rate and highly reliable information can be transmitted by further supermodulating blocks of traditionally modulated and space-time encoded information. This is achieved by exploiting the redundant information introduced by the space-time-block codes and, specifically, by efficiently mapping transmission patterns to specific information content. It is shown that Space-Time SuperModulation can be efficiently used in the context of MachineType-Communications to enable joint medium access and rateless data transmission while minimizing or even eliminating the need for transmitting preamble sequences. Compared with traditional approaches that use encoded preambles or preambles based on Zadoff-Chu sequences to transmit the signature information of transmitted packets, Space-Time Super-Modulation can achieve throughput gains of more than 45% when transmitting blocks of 200 symbols.
Mao J, Wang C, Zhang L, He C, Xiao P, Nikitopoulos K (2018) A DHT-based Multicarrier Modulation System with Pairwise ML Detection, Proceedings of the 2017 IEEE 28th Annual International Symposium on Personal, Indoor, and Mobile Radio Communications (PIMRC) pp. 1-6 IEEE
This paper proposes a complex-valued discrete
multicarrier modulation (MCM) system based on
the real-valued discrete Hartley transform (DHT) and
its inverse (IDHT). Unlike conventional discrete Fourier
transform (DFT), DHT can not diagonalize the multipath
fading channel due to its inherent properties, which results
in the mutual interference between subcarriers in the
same mirror-symmetrical pair.We explore the interference
pattern in order to seek an optimal solution to utilize the
channel diversity for the purpose of enhancing system bit
error performance (BEP). It is shown that the optimal
channel diversity gain can be achieved via a pairwise
maximum likelihood (ML) detection, taking into account
not only the subcarrier?s own channel quality but also the
channel state of its mirror-symmetrical peer. Performance
analysis indicates that DHT-based MCM mitigates the fast
fading effect by averaging the channel power gain on the
mirror-symmetrical subcarriers. Simulation results show
that the proposed system has a substantial improvement
in BEP over conventional DFT-Based MCM.
Husmann C, Nikolaou P, Nikitopoulos K (2017) Reduced Latency ML Polar Decoding via Multiple Sphere-Decoding Tree Searches, IEEE Transactions on Vehicular Technology 67 (2) pp. 1835-1839 IEEE
Sphere decoding (SD) has been proposed as an
efficient way to perform maximum-likelihood (ML) decoding of
Polar codes. Its latency requirements, however, are determined
by its ability to promptly exclude from the ML search (i.e., prune)
large parts of the corresponding SD tree, without compromising
the ML optimality. Traditional depth-first approaches initially
find a ?promising" candidate solution and then prune parts of
the tree that cannot result to a ?better" solution. Still, if this
candidate solution is far (in terms of Euclidean distance) from
the ML one, pruning becomes inefficient and decoding latency
explodes. To reduce this processing latency, an early termination
approach is, first, introduced that exploits the binary nature of
the transmitted information. Then, a simple but very efficient
SD approach is proposed that performs multiple tree searches
that perform decreasingly aggressive pruning. These searches are
almost independent and can take place sequentially, in parallel, or
even in a hybrid (sequential/parallel) manner. For Polar codes of
128 block size, both realizations can provide a latency reduction
of up to four orders of magnitude compared to state-of-the-art
Polar sphere decoders. Then, a further 50% latency reduction
can be achieved by exploiting the parallel nature of the approach.
Nikitopoulos Konstantinos, Mehran Farhad, Jafarkhani H (2017) Space-Time Super-Modulation: Concept, Design Rules, and its Application to Joint Medium Access and Rateless Transmission, IEEE Transactions on Wireless Communications 16 (12) pp. 8275-8288 IEEE
We introduce the concept of Space-Time Super-Modulation according to which additional lowrate
and highly reliable information can be transmitted on top of traditionally modulated and spacetime
encoded information, without increasing the transmitted block length or degrading their error-rate
performance. This is achieved by exploiting the temporal redundancy introduced by the space-time block
codes and, specifically, by efficiently mapping transmission patterns to specific information content.
We show that Space-Time Super-Modulation can be efficiently used in the context of machine-type
communications to enable ?one-shot?, ?grant-free" joint medium access and rateless data transmission
while reducing or even eliminating the need for transmitting preamble sequences. As a result, compared
with traditional approaches that use correlatable preamble sequences or encoded preambles to transmit
the signature information of transmitted packets, Space-Time Super-Modulation can achieve significant
throughput gains. For example, we show up to 35% throughput gains from the second best examined
preamble-based scheme when transmitting blocks of 200 bits.
He Chang, Xiao Pei, Zhang Lei, Mao Juquan, Cao Aijun, Nikitopoulos Konstantinos (2017) Efficient DCT-MCM Detection for Single and Multi-Antenna Wireless Systems, Proceedings of the 2017 IEEE 28th Annual International Symposium on Personal, Indoor, and Mobile Radio Communications (PIMRC) IEEE
The discrete cosine transform (DCT) based multicarrier
modulation (MCM) system is regarded as one of the
promising transmission techniques for future wireless communications.
By employing cosine basis as orthogonal functions
for multiplexing each real-valued symbol with symbol period
of T , it is able to maintain the subcarrier orthogonality while
reducing frequency spacing to 1/(2T ) Hz, which is only half
of that compared to discrete Fourier transform (DFT) based
multicarrier systems. In this paper, following one of the effective
transmission models by which zeros are inserted as guard
sequence and the DCT operation at the receiver is replaced
by DFT of double length, we reformulate and evaluate three
classic detection methods by appropriately processing the post-
DFT signals both for single antenna and multiple-input multipleoutput
(MIMO) DCT-MCM systems. In all cases, we show that
with our reformulated detection approaches, DCT-MCM schemes
can outperform, in terms of error-rate, conventional OFDMbased
systems.
Mehran Farhad, Nikitopoulos Konstantinos (2018) Generalized Space-Time Super-Modulation for Headerless Grant-Free Rateless Multiple Access, Proceedings of the IEEE Global Communications Conference 2018 Institute of Electrical and Electronics Engineers (IEEE)
This work introduces Generalized Space-Time
Super-Modulation (GSTSM), a generalization of the recently
proposed Space-Time Super-Modulation scheme that enables the
transmission of additional, highly-reliable information on the top
of conventionally transmitted symbols, without increasing the
corresponding packet length. GSTSM jointly exploits the spatial
and temporal dimensions of multiple-antenna systems but, in
contrast to the initially proposed approach, it does not require the
use of space-time block codes. Instead, GSTSM jointly elaborates
on the concepts of spatial modulation and spatial diversity, while
intentionally introducing temporal correlation to the transmitted
symbol sequence. In the context of machine-type communications,
GSTSM enables one-shot and grant-free medium access without
transmitting additional headers to convey each machine?s ID. As
a result, we show that GSTSM can provide throughput gains
of up to 2.5 X compared to conventional header-based schemes,
even in the case of colliding packets.
Husmann Christopher Camilo Mischa, Nikitopoulos Konstantinos (2018) ViPer MIMO: Increasing Large MIMO Efficiency via Practical Vector-Perturbation, Proceedings of IEEE Globecom2018 WC IEEE
Large multi-user MIMO systems with spatial multiplexing
are among the most promising approaches for increasing
wireless throughput while serving many clients. Yet, the achievable
spectral efficiency of current large MIMO systems is limited
by the adoption of simple, but sub-optimal, linear precoding
techniques (e.g, minimum-mean-square-error (MMSE)). Nonlinear
precoding methods, like Vector Perturbation (VP), claim to
be able to provide improved network throughput. However, such
methods are still purely theoretical and they do not account for
the practical aspects of actual wireless systems, as the corresponding
complexity and latency requirements, or the need for practical
rate adaptation. This paper presents ViPer, the first practical
VP-based MIMO system design. ViPer substantially reduces the
latency requirements of VP by employing massively parallel
processing and realizes a practical rate adaptation method that
efficiently translates VP?s signal-to-noise-ratio (SNR) gains into
actual throughput gains. In our first systematic experimental
evaluation of VP-based precoders, we show that ViPer can
deliver in practice up to 30% higher throughput than MMSE
precoding with comparable latency requirements. In addition,
ViPer can match the performance of state-of-the-art parallel
VP precoding schemes, by utilizing less than one tenth of the
processing elements.
Husmann Christopher, Tafazolli Rahim, Nikitopoulos Konstantinos Antipodal Detection and Decoding for Large Multi-User MIMO with Reduced Base-Station Antennas, Proceedings of the IEEE GLOBECOM 2018 Workshops Institute of Electrical and Electronics Engineers (IEEE)
To avoid unnecessarily using a massive number of
base station antennas to support a large number of users spatially
multiplexed multi-user MIMO systems, optimal detection
methods are required to demultiplex the mutually interfering
information streams. Sphere decoding (SD) can achieve this,
but its complexity and latency becomes impractical for large
MIMO systems. Low complexity detection solutions such as linear
detectors (e.g., MMSE) or likelihood ascendant search (LAS)
approaches, have significantly lower latency requirements than
SD but their achievable throughput is far from optimal. This
work presents the concept of Antipodal detection and decoding,
that can deliver very high throughput with practical latency
requirements, even in systems where the number of user antennas
reaches the number of base station antennas. The Antipodal
detector either results in a highly reliable vector solution, or it
does not find a vector solution at all (i.e., it results in an erasure),
skipping the heavy processing load related to finding vector
solutions that have a very high likelihood to be erroneous. Then,
a belief-propagation-based decoder is proposed, that restores
these erasures and further corrects remaining erroneous vector
solutions. We show that for 32å32, 64-QAM modulated systems,
and for packet error rates below 10%, Antipodal detection and
decoding requires 9 dB less transmitted power than systems
employing soft MMSE or LAS detection and LDPC decoding
with similar complexity requirements. For the same scenario, our
Antipodal method achieves practical throughput gains of more
than 50% compared to soft MMSE and soft LAS-based methods.
Nikitopoulos Konstantinos, Georgis Georgios, Jayawardena Chathura, Chatzipanagiotis Daniil, Tafazolli Rahim (2018) Massively Parallel Tree Search for High-Dimensional Sphere Decoders, Transactions on Parallel and Distributed Systems IEEE
The recent paradigm shift towards the transmission of large numbers of mutually interfering information streams, as in the
case of aggressive spatial multiplexing, combined with requirements towards very low processing latency despite the frequency
plateauing of traditional processors, initiates a need to revisit the fundamental maximum-likelihood (ML) and, consequently, the
sphere-decoding (SD) detection problem. This work presents the design and VLSI architecture of MultiSphere; the first method to
massively parallelize the tree search of large sphere decoders in a nearly-concurrent manner, without compromising their
maximum-likelihood performance, and by keeping the overall processing complexity comparable to that of highly-optimized sequential
sphere decoders. For a 10 å 10 MIMO spatially multiplexed system with 16-QAM modulation and 32 processing elements, our
MultiSphere architecture can reduce latency by 29å against well-known sequential SDs, approaching the processing latency of linear
detection methods, without compromising ML optimality. In MIMO multicarrier systems targeting exact ML decoding, MultiSphere
achieves processing latency and hardware efficiency that are orders of magnitude improved compared to approaches employing one
SD per subcarrier. In addition, for 16å16 both ?hard?- and ?soft?-output MIMO systems, approximate MultiSphere versions are shown to
achieve similar error rate performance with state-of-the art approximate SDs having akin parallelization properties, by using only one
tenth of the processing elements, and to achieve up to approximately 9å increased energy efficiency.
Jayawardena Chathura, Nikitopoulos Konstantinos Massively Parallel Detection for Non-Orthogonal Signal Transmissions, Proceedings of the IEEE GLOBECOM 2018 Workshops Institute of Electrical and Electronics Engineers (IEEE)
The increasing demand for massive connectivity
with low latency requirements has triggered a paradigm shift
towards Non-Orthogonal transmissions. Still, to translate the
theoretical gains of Non-Orthogonal transmissions into practical,
efficient ?soft? detection schemes are required. The detection
latency and/or complexity of state-of-the-art detection methods
becomes impractical for large Non-Orthogonal systems, both
due to the large number of interfering streams and due to the
rank-deficient or ill-determined nature of the corresponding interference
matrix. Extending the recently proposed MultiSphere
framework, this work introduces NorthCore; a massively parallel
sphere-decoding-based scheme for the detection of large and illdetermined
Non-Orthogonal systems. Similarly to MultiSphere,
NorthCore reduces the corresponding search space by focusing
the available processing power to the most promising vector
solutions that are processed in parallel. As a result, the proposed
detection scheme can attain a detection processing latency similar
to that of highly-suboptimal linear detectors and even outperform
state-of-the-art sophisticated detection approaches with up to
an order of magnitude reduced complexity. To identify the
most promising vector solutions, NorthCore introduces a sortfree
candidate selection technique that reduces the necessary
preprocessing complexity by up to an order of magnitude, making
the proposed approach practical.
He Chang, Cao Aijun, Xiao Lixia, Zhang Lei, Xiao Pei, Nikitopoulos Konstantinos (2019) Enhanced DCT-OFDM System With Index Modulation, IEEE Transactions on Vehicular Technology Institute of Electrical and Electronics Engineers (IEEE)
Discrete cosine transform (DCT) based orthogonal
frequency division multiplexing (OFDM), which has double number
of subcarrier compared to the classic discrete fourier transform
(DFT) based OFDM (DFT-OFDM) at the same bandwidth,
is a promising high spectral efficiency multicarrier techniques
for future wireless communication. In this paper, an enhanced
DCT-OFDM with index modulation (IM) (EDCT-OFDM-IM) is
proposed to further exploit the benefits of the DCT-OFDM and
IM techniques. To be more specific, a pre-filtering method based
DCT-OFDM-IM transmitter is first designed and the non-linear
maximum likelihood (ML) is developed for our EDCT-OFDM-IM
system. Moreover, the average bit error probability (ABEP) of the
proposed EDCT-OFDM-IM system is derived, which is confirmed
by our simulation results. Both simulation and theoretical results
are shown that the proposed EDCT-OFDM-IM system exhibits
better bit error rate (BER) performance over the conventional
DFT-OFDM-IM and DCT-OFDM-IM counterparts.
He Chang, Cao Aijun, Xiao Lixia, Zhang Lei, Xiao Pei, Nikitopoulos Konstantinos (2019) Index modulation assisted DCT-OFDM with Enhanced Transceiver Design, Proceedings of the 53rd IEEE International Conference on Communications (IEEE ICC 2019) Institute of Electrical and Electronics Engineers (IEEE)
An index modulation (IM) assisted Discrete Cosine
Transform based Orthogonal Frequency Division Multiplexing
(DCT-OFDM) with Enhanced Transmitter Design (termed as
EDCT-OFDM-IM) is proposed. It amalgamates the concept
of Discrete Cosine Transform assisted Orthogonal Frequency
Division Multiplexing (DCT-OFDM) and Index Modulation (IM)
to exploit the design freedom provided by the double number
of available subcarrier under the same bandwidth. In the
proposed EDCT-OFDM-IM scheme, the maximum likelihood
(ML) detector used for symbol bits and index bits recovering
is derived and the sophisticated designing guidelines for EDCTOFDM-IM are provided. Based on the derived pairwise error
event probability, a theoretical upper bound on the average biterror probability (ABEP) of EDCT-OFDM-IM is provided over
multipath fading channels. Furthermore, the maximum peak-toaverage power ratio (PAPR) of our proposed EDCT-OFDM-IM
scheme is derived and compared to than the general Discrete
Fourier Transform (DFT) based OFDM-IM counterpart.
Payami Sohail, Sellathurai Mathini, Nikitopoulos Konstantinos (2019) Low-Complexity Hybrid Beamforming for Massive MIMO Systems in Frequency-Selective Channels, IEEE Access 7 pp. 36195-36206 IEEE
Hybrid beamforming for frequency-selective channels is a challenging problem, as the phase shifters provide the same phase shift to all the subcarriers. The existing approaches solely rely on the channel?s frequency response, and the hybrid beamformers maximize the average spectral efficiency over the whole frequency band. Compared to state-of-the-art, we show that substantial sum-rate gains can be achieved, both for rich and sparse scattering channels, by jointly exploiting the frequency- and time-domain characteristics of the massive multiple-input multiple-output (MIMO) channels. In our proposed approach, the radio frequency (RF) beamformer coherently combines the received symbols in the time domain and, thus, it concentrates the signal?s power on a specific time sample. As a result, the RF beamformer flattens the frequency response of the ?effective? transmission channel and reduces its root-mean-square delay spread. Then, a baseband combiner mitigates the residual interference in the frequency domain. We present the closed-form expressions of the proposed beamformer and its performance by leveraging the favorable propagation condition of massive MIMO channels, and we prove that our proposed scheme can achieve the performance of fully digital zero-forcing when the number of employed phases shifter networks is twice the resolvable multipath components in the time domain.characteristics of the massive multiple-input multiple-output (MIMO) channels. In our proposed approach,
the radio frequency (RF) beamformer coherently combines the received symbols in the time domain and,
thus, it concentrates the signal's power on a specific time sample. As a result, the RF beamformer flattens
the frequency response of the ``effective'' transmission channel and reduces its root-mean-square delay
spread. Then, a baseband combiner mitigates the residual interference in the frequency domain. We present
the closed-form expressions of the proposed beamformer and its performance by leveraging the favorable
propagation condition of massive MIMO channels, and we prove that our proposed scheme can achieve the
performance of fully digital zero-forcing when the number of employed phases shifter networks is twice the
resolvable multipath components in the time domain.
Mehran F., Nikitopoulos K., Xiao P., Chen Q. (2015) Rateless wireless systems: gains, approaches, and challenges, 2015 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP) Proceedings pp. 751-755 IEEE
State-of-the-art channel coding schemes promise data rates close to the wireless channel capacity. However, efficient link adaptation techniques are required in order to deliver such throughputs in practice. Traditional rate adaptation schemes, which are reactive and try to ?predict? the transmission mode that maximizes throughput based on ?transmission quality indicators?, can be highly inefficient in an evolving wireless ecosystem where transmission can become increasingly dynamic and unpredictable. In such scenarios, ?rateless? link adaptation can be highly beneficial. Here, we compare popular rateless approaches in terms of gains and practicality in both traditional and more
challenging operating scenarios. We also discuss challenges that need to be addressed to make such systems practical for future wireless communication systems.
Jayawardena Chathura, Nikitopoulos Konstantinos (2019) G-MultiSphere: Generalizing Massively Parallel Detection for Non-Orthogonal Signal Transmissions, IEEE Transactions on Communications pp. 1-12 Institute of Electrical and Electronics Engineers (IEEE)
The increasing demand for connectivity and throughput, despite the spectrum limitations, has triggered a paradigm shift towards non-orthogonal signal transmissions. However, the complexity requirements of near-optimal detection methods for such systems becomes impractical, due to the large number of mutually interfering streams and to the rank-deficient or ill-determined nature of the corresponding interference matrix. This work introduces g-MultiSphere; a generic massively parallel and near-optimal sphere-decoding-based approach that, in contrast to prior work, applies to both well- and ill-determined non-orthogonal systems. We show that g-MultiSphere is the first approach that can support large uplink multi-user MIMO systems with numbers of concurrently transmitting users that exceed the number of receive antennas by a factor of two or more, while attaining throughput gains of up to 60% and with reduced complexity requirements in comparison to known approaches. By eliminating the need for sparse signal transmissions for nonorthogonal multiple access (NOMA) schemes, g-MultiSphere can support more users than existing systems with better detection performance and practical complexity requirements. In comparison to state- of-the-art detectors for NOMA schemes and nonorthogonal signal waveforms (e.g., SEFDM) g-MultiSphere can be up to an order of magnitude less complex, and can provide throughput gains of up to 60%.
Payami Sohail, Khalily Mohsen, Taheri Sohail, Nikitopoulos Konstantinos, Tafazolli Rahim (2020) Channel Measurement and Analysis for Polarimetric
Wideband Outdoor Scenarios at 26 GHz:
Directional vs Omni-Directional,
EUCAP 2020
This paper presents the measurement results and
analysis for outdoor wireless propagation channels at 26 GHz
over 2 GHz bandwidth for two receiver antenna polarization
modes. The angular and wideband properties of directional
and virtually omni-directional channels, such as angular spread,
root-mean-square delay spread and coherence bandwidth, are
analyzed. The results indicate that the reflections can have a significant
contribution in some realistic scenarios and increase the
angular and delay spreads, and reduce the coherence bandwidth
of the channel. The analysis in this paper also show that using
a directional transmission can result in an almost frequencyflat
fading channel over the measured 2 GHz bandwidth; which
consequently has a major impact on the choice of system design
choices such as beamforming and transmission numerology.
Georgis Georgios, Thanos Alexios, Filo Marcin, Nikitopoulos Konstantinos (2020) A DSP ACCELERATION FRAMEWORK FOR SOFTWARE-DEFINED RADIOS ON X86 64, ICASSP 2020
This paper presents a DSP acceleration and assessment framework targeting SDR platforms on x86 64 architectures. Driven by the
potential of rapid prototyping and evaluation of breakthrough concepts that these platforms provide, our work builds upon the wellknown
OpenAirInterface codebase, extending it for advanced, previously unsupported modes towards large and massive MIMO such as non-codebook-based multi-user transmissions. We then develop an acceleration/profiling framework, through which we present finegrained
execution results for DSP operations. Incorporating the latest SIMD instructions, our acceleration framework achieves a unitary speedup of up to 10. Integrated into OpenAirInterface, it accelerates computationally expensive MIMO operations by up to 88% across tested modes. Besides resulting in a useful tool for the community, this work provides insight on runtime DSP complexity and the potential of modern x86 64 systems.
Georgis Georgios, Filo Marcin, Thanos Alexios, Husmann Christopher, De Luna Ducoing Juan Carlos, Tafazolli Rahim, Nikitopoulos Konstantinos (2019) SWORD: Towards a Soft and Open Radio Design
for Rapid Development, Profiling,
Validation and Testing,
IEEE Access Institute of Electrical and Electronics Engineers
The vision, as we move to future wireless communication systems, embraces diverse qualities
targeting significant enhancements from the spectrum, to user experience. Newly-defined air-interface
features, such as large number of base station antennas and computationally complex physical layer
approaches come with a non-trivial development effort, especially when scalability and flexibility need to
be factored in. In addition, testing those features without commercial, off-the-shelf equipment has a high
deployment, operational and maintenance cost. On one hand, industry-hardened solutions are inaccessible
to the research community due to restrictive legal and financial licensing. On the other hand, researchgrade
real-time solutions are either lacking versatility, modularity and a complete protocol stack, or, for
those that are full-stack and modular, only the most elementary transmission modes are on offer (e.g., very
low number of base station antennas). Aiming to address these shortcomings towards an ideal research
platform, this paper presents SWORD, a SoftWare Open Radio Design that is flexible, open for research,
low-cost, scalable and software-driven, able to support advanced large and massive Multiple-Input Multiple-
Output (MIMO) approaches. Starting with just a single-input single-output air-interface and commercial
off-the-shelf equipment, we create a software-intensive baseband platform that, together with an acceleration/
profiling framework, can serve as a research-grade base station for exploring advancements towards
future wireless systems and beyond.
Payami Sohail, Khalily Mohsen, Loh Tian Hong, Nikitopoulos Konstantinos (2020) Hybrid Beamforming with Switches and Phase Shifters over Frequency-Selective Channels, IEEE Wireless Communications Letters pp. 1-1 Institute of Electrical and Electronics Engineers (IEEE)
The recent studies on hybrid beamformers with a combination of switches and phase shifters indicate that such methods can reduce the cost and power consumption of massive multiple-input multiple-output (MIMO) systems. However, most of the works have focused on the scenarios with frequency-flat channel models. This letter proposes an effective approach for such systems in frequency-selective channels and presents the closed-form expressions of the beamformer and the corresponding sum-rates. Compared to the traditional subconnected structures, our approach with a significantly smaller number of phase shifters results in a promising performance.
Mao Juquan, Zhang Lei, Xiao Pei, Nikitopoulos Konstantinos (2020) Filtered OFDM: An Insight into Intrinsic
In-Band Interference and Filter
Frequency Response Selectivity,
IEEE Access 8 Institute of Electrical and Electronics Engineers
The future mobile networks will face challenges in support of heterogeneous services over
a unified physical layer, calling for a waveform with good frequency localization. Filtered orthogonal
frequency division multiplexing (f-OFDM), as a representative subband filtered waveform, can be employed
to improve the spectrum localization of orthogonal frequency-division multiplexing (OFDM) signal.
However, the applied filtering operations will impact the performance in various aspects, especially for
narrow subband cases. Unlike existing studies which mainly focus its benefits, this paper investigates
two negative consequences inflicted on single subband f-OFDM systems: in-band interference and filter
frequency response (FFR) selectivity. The exact-form expression for the in-band interference is derived, and
the effect of FFR selectivity is analyzed for both single antenna and multiple antenna cases. The in-band
interference-free and nearly-free conditions for f-OFDM systems are studied. A low-complexity blockwise
parallel interference cancellation (BwPIC) algorithm and a pre-equalizer are proposed to tackle the
two issues caused by the filtering operations, respectively. Numerical results show that narrower subbands
suffer more performance degradation compared to wider bands. In addition, the proposed BwPIC algorithm
effectively suppresses interference, and pre-equalized f-OFDM (pf-OFDM) considerably outperforms f-
OFDM in both single antenna and multi-antenna systems.