Download presentation
Presentation is loading. Please wait.
Published bySharleen Cox Modified over 9 years ago
1
Calculating Discrete Logarithms John Hawley Nicolette Nicolosi Ryan Rivard
2
Discrete Logarithms We want to find a unique integer x such that α x = β (mod n). We can find x by solving: x = log α β (mod n). But…
3
Logarithms are easy! Logarithms in real numbers are easy to calculate, partially because the log function is continuous and monotonically increasing. Discrete Logarithms don’t have either of these properties. For example, in a (mod 5) system, the powers of 2 are 1, 2, 4, 3. This wraparound makes the discrete log function significantly harder to compute than the ordinary log function.
4
Definitions Multiplicative Group – A set of congruence classes that are relatively prime to the modulus. We used the group Z p, where the modulus is a prime number and the group is cyclic (the values repeat).
5
Definitions Order of a Group – The number of elements in a group, which can be found using Euler’s totient function – For Z p, this is p − 1 – For Z p k, it is (p − 1) p k − 1 Generators and Primitive Elements – An element that produces the other elements of the group when raised to various powers. Primitive elements are also generators.
6
The Problem We have a multiplicative group (G, *), α is a generator of G having order n, and β is an element generated by α. Remember, we want to find a unique integer x such that α x = β (mod n), by solving x = log α β (mod n).
7
The Problem Computing α x = β for a given x is simple and efficient using the square-and- multiply algorithm for exponentiation. Computing a = log α β is difficult and can consume a large amount of time and memory for large values, such as those used in cryptography.
8
The Problem This property makes discrete logs ideal for cryptographic applications because one function is easy, but the inverse function is difficult. There is a class of public-key cryptosystems that use the discrete logarithm problem for key generation and encryption/decryption.
9
The El Gamal Cryptosystem p is a prime number chosen so that the Discrete Logarithm problem is infeasible in (Z p *, *). α is a generator in Z p *. β is computed as follows: β = α x mod p The public key consists of p, α, and β. The private key is x.
10
Attacking El Gamal An attacker wants to obtain the private key x so that they can encrypt and decrypt messages as the user. An attack using the Discrete Logarithm problem would attempt to compute x = log α β (mod p). Given that α, β, and p are part of the public key, this attack does not require intercepting any data other than the public key.
11
The Algorithm By rewriting α x = β in the form β(α −m ) i = α j, where x = im + j, we can precompute values of α j for several values of j, storing them in a hash table. The highest j computed can be used for m and values of i can be iteratively tried on the left-hand side of the equation.
12
The Algorithm The running time is O(m + n/m), where m is the number of entries in the hash table. Optimal running time when m = sqrt(n), but this might not be feasible based on the amount of available memory. For example, a 512-bit n would require 2 256 hash table entries for optimal performance, but even at a single byte per entry (a significant underestimate), this is over 10 65 Terabytes of data.
13
Pseudocode BABY-STEP-GIANT-STEP( α, β, m, n) FOR j IN [0, m] HASH-TABLE-PUT( α j (mod n), j) Compute α −m (mod n) γ ← β FOR i IN [0, ceil(n/m)] IF HASH-TABLE-CONTAINS-KEY( γ ) j ← HASH-TABLE-GET( γ ) RETURN (i * m) + j ELSE γ ← γ ∗ α −m (mod n)
14
Benefits of Parallelization Being able to compute Discrete Logarithms faster for large values could make cryptosystems that use this problem vulnerable to attack. Discovering this could result in more secure encryption of confidential data using systems that are not vulnerable to this attack.
15
Implementation The hash table can be distributed in order to take advantage of total memory capacity of a cluster. Each node can independently build a portion of the hash table. A barrier can be used to ensure that all nodes have completed this first stage before continuing. Once the hash table has been generated, each node can perform a subset of the collision checks independently. Thus, the only communication ever required between nodes is to handle the distributed hash table inserts/lookups.
16
Implementation Drawbacks Whenever a node needs to look up a hash table key that is stored on another node, network latency becomes a factor. While hash tables for very large numbers will quickly exhaust available memory, numbers that are reasonably sized for running tests on the Paranoia cluster do not even come close to this limit. Better performance may be seen on SMP machines, but the number of processors available is limited. Hybrid SMP/Cluster parallel systems might be an effective compromise.
17
Alternative Ideas Since the hash tables for the problems we are solving are small enough to fit on a single node, we could fill the table in a distributed manner, and all-gather the resulting hash table. Each node would then have a local copy of the table and network latency would be eliminated for hash table lookups. Each node could perform only hash-put and hash-get requests on its local table, and could simply ignore requests for other nodes. In this way, the nodes would all be testing the same values, but would still benefit from the increased memory of a cluster over a single machine.
18
Results Initial increase of processors from 1 to 2 showed a large slowdown. Large number of sends / receives Decent speedup from 2 to 32 processors
19
Results Limited time for testing does not allow us to use the algorithm’s full potential Algorithm is based on a trade off of memory and time Optimal size for hash table is sqrt(n) entries On small 32 bit numbers, memory usage is only around 256kb
20
Results Any recent cryptosystem will use at least 128bit numbers. To use the optimal sqrt(n) entries, this would require approximately 268,435,456 Terabytes of memory Clearly an infeasible amount to use the optimally sized hash table on a single (or multiple) machines.
21
Results Distributed memory allows us to use a larger hash map, thus reducing the runtime of the algorithm Results expected to surpass linear speedup, due to an improvement in the asymptotic running time.
22
Graph - 8-bit Input
23
Graph - 16-bit Input
24
Graph - 32-bit Input
25
Graph - 8-bit and 16-bit Inputs
26
Conclusions The speedup observed from 2 to 32 processors points towards MPI overhead as major cause for initial slowdown. On small data, shared memory would have been a better choice Still can use optimally sized hash map Small overhead, little synchronization needed
27
Questions?
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.