Clustering Cryptocurrencies
1. Introduction
Why is clustering interesting? How to value cryptocurrencies has been a major question ever since so many began finding their way to market. As Qunitero (2018) points out, “having a clear and unbiased benchmark while evaluating new decentralized projects in the crypto economy†could help to answer the question of valuation. Clustering commonly occurs around token type: thus, one routinely sees the clustering of currency tokens, platform tokens, utility tokens, brand tokens, and security tokens. Yet these are not the only clusters that may appear, the more closely one looks at the space. As clustering shows which cryptocurrencies move in tandem at the top of the market cap, it is useful to examine clustering cryptocurrencies to see what similarities in movement might tell us.
Are fundamental similarities backed by market metrics? That is the main question to be asked and an important one because clusters can be used to formulate trading strategies. However, Qunitero (2018) notes that there is more than one cluster in the cryptocurrency space—in fact, there are numerous ones. Identifying them and understanding the relationship among assets is critical to devising a successful trading strategy. Identifying clusters as part of developing a trading strategy for cryptocurrencies could help make the space far more viable for investors and speculators alike. “There do seem to exist natural clusters of coins that move in tandem,†Quintero (2018) states—which means more cryptocurrency samples need to be examined in order to clarify the seeming relationships.
2. Method and Results
Part I: Developing a Method
The problem of time series clustering can be considered as finding a function:
$$f(X_T) = y \\in [1…K]$$$$\\text{for }X_T=(x_1, …, x_T)$$$$\\text{with }x_T \\in\\mathbb{R^d}$$
where T is timeline length and K is particular cluster. This should be conducted with representation of time series as a set of selected features vi of fixed size D independent of T.
With this representation, applying standard clustering algorithms on this feature set can be possible. The main question is what features to consider when applying the algorithm? For the purpose of this study, we identified multiple time series describing each coin and we also constructed derivative parameters to define these series.
Next, we devised a method of moving from simple to complex in terms of identifying clusters:
1. We used common, standard features for each series (parameter): Means, Medians, Standard deviations, Skewness, and Kurtosis.
2. We used tsfresh library to automate the process of features extraction.
3. We applied both approaches to series fragmented by state of BTC.
DBSCAN
It was important to identify a clustering method that could be applied quickly to facilitate trading and allow easy scaling. The clustering method selected, therefore, was DBSCAN, one of the most universal and applicable algorithms available today. The DBSCAN algorithm views clusters as areas of high density separated by areas of low density. Due to this simple if generic function, clusters found by DBSCAN can take any shape, as opposed to the k-means method, which assumes that clusters are convex shaped. As the purpose of this study was to identify clusters without applying presupposed views of what they should look like, the k-means method was inappropriate and DBSCAN, with its basic approach to recognizing clusters, fit much more effectively. This is why the data obtained in this study is quite rarified.
The central component to the DBSCAN is the concept of core samples, which are samples that occur in areas of high density. A cluster is therefore a set of core samples, each close to one another (measured by a distance measure) and a set of non-core samples that are close to a core sample (but are not themselves core samples). There are two parameters to the algorithm, min_samples and eps, which define formally what is meant by density. Higher min_samples or lower eps indicate higher density necessary to form a cluster.
The additional advantage of DBSCAN is the calculation of an estimated number of clusters that it permits. Using DBSCAN, top-level clusters could be obtained using data presence across all given coins. The Extractor function of basic features was applied to each large cluster identified. Clustering then became possible inside top-level clusters following the application of this function.
This quite basic method of features extraction for the development of coin profiles can be scaled: for example, features can be extracted for different periods of time, forming wider sets of features for each coin. Alternatively, a measure of similarity can be found for different periods and compiled in unified metrics across all periods. In short, clustering can be conducted across multiple variables as inputs. The next step in the process was to perform clustering relying on extracted features and additionally to use tsfresh library as an alternative approach.
Part II: Clustering Inside Top-Level Clusters
To accomplish clustering inside top-level clusters, which were identified using DBSCAN, Hierarchical Density-Based Spatial Clustering of Applications with Noise (HDBSCAN) was used to perform the DBSCAN over varying epsilon values and to integrate the results to find a clustering that gives the best stability over epsilon. HDBSCAN allowed us to find clusters of varying densities (unlike DBSCAN), and be more robust in terms of parameter selection.
The limitations of this approach are that it is highly general. It does not detect mutual dependencies, following trends or other complex arrangement. Essentially, it is clustering by basic time series characteristics. Top-level clustering is a rather necessary measure and the list of features describing each time series is short, but it can easily be extended with additional features.
HDBSCAN (as an extension of DBSCAN method) groups together points that are closely packed together (points with many nearby neighbors), marking as outliers points that lie alone in low-density regions (whose nearest neighbors are too far away). This allows “unclustered coins†to appear: they are outliers categorized under the -1 label. This group consists of two types of coins: coins with a large amount of data that is not collected and actual outliers. Each coin presented in this group should be considered as special, including the crypto-headliners which appeared there. By many different sets of parameters, these headliners are actual outliers.
Part III: Features Extraction and Clustering
A new instance of Extractor class can be initiated by passing a periods_of_interest argument, which will slice the scope of coin by time axis with respect to periods of interest. This parameter is helpful with bull_bear_periods function execution. To ascertain market states, it is necessary to obtain periods of different market states in accordance with bullish, bearish or stable BTC trends. To perform clustering with the basic features extractor, it was first necessary to perform top-level clustering that would separate coins to groups in correspondence with the presence of data.
The result revealed three separate market states with corresponding time periods, which meant it was necessary to create three sets in accordance with states and perform clustering for each of them. HDBSCAN was used to obtain the clusters, and clustering in bearish periods appeared. The combination of using algorithm and passed parameters indeed allowed us to identify one brief bearish period (early 2018 following the steep fall in BTC). However, the aim is to make bull_bear_periods less rigorous. Clustering also appeared in bullish periods, with more clustering appearing in bullish periods. In stable periods (sideways trading), clustering also appeared. This was immensely interesting for the following reasons:
1. Quite a few stable periods were detected on the BTC timeline, making this part of the sample by far the most representative.
2. In contrast to bullish and bearish BTC states, the stable or sideways trading state can be postulated intuitively. The terms bullish and bearish are blurred and there no strong definitions of them, because they are not really understood as such until years later, looking back—but everyone can see when BTC is in a flat or sideways trading phase. In this context, the stable trading market offers the most predictive value.
Part IV: Features Extraction and Clustering
Comparing clusters obtained via various methods can be illuminative and offer insights into possible improvements of created pipeline and further research. There are several reasons why comparing clusters obtained by different methods (i.e., basic clustering and tsfresh clustering) is unsuitable. The main reasons are:
1. The metrics used by each approach are different.
2. There were no top-level clusters in the case of tsfresh.
In the case of tsfresh-based clustering we faced a problem of high dimensionality that did not permit a view of coins as points in space. Nonetheless we could create a simplified visualization of clusters as a graph, which consisted of connected subgraphs. Each subgraph represented corresponding clusters with all nodes mutually connected. These visualizations also revealed important findings:
· All coins inside clusters are connected a priori between each other. This means that special attention should be given to coins lying on the border of dense regions and their connections.
· Graphs do not take into account real distance between coins, but only illustrate which coins are connected, though not how they are connected.
· While one particular method may consider a coin an outlier, another method can find it in a cluster with others at the same time. This will lead to the presence of corresponding nodes on the graph.
3. Conclusion
Fundamentals cannot explain crypto price dynamics. Traditional fundamental clustering does not explain price movement. Therefore, we must reject the hypothesis that “Because they are currency coins we consider them to behave similarly†and we must reject trading based on fundamental clusters. As the study has shown the method used to obtain clusters will lead to significant deviations. The trading implications of this study are that clustering methods can be incorporated into an actual trading strategy, but more information is needed to determine the appropriate method. Market sluggishness and sensitivity could enable more prolonged measures to be conducted. Using HDBSCAN to study metrics and distances should also be explored.
References
Quintero, S. (2018). Clustering cryptocurrencies with affinity propagation. Retrieved
from https://towardsdatascience.com/clustering-cryptocurrencies-with-affinity-propagation-b0ca9e105bb6
Get Professional Assignment Help Cheaply
Are you busy and do not have time to handle your assignment? Are you scared that your paper will not make the grade? Do you have responsibilities that may hinder you from turning in your assignment on time? Are you tired and can barely handle your assignment? Are your grades inconsistent?
Whichever your reason is, it is valid! You can get professional academic help from our service at affordable rates. We have a team of professional academic writers who can handle all your assignments.
Why Choose Our Academic Writing Service?
- Plagiarism free papers
- Timely delivery
- Any deadline
- Skilled, Experienced Native English Writers
- Subject-relevant academic writer
- Adherence to paper instructions
- Ability to tackle bulk assignments
- Reasonable prices
- 24/7 Customer Support
- Get superb grades consistently
Online Academic Help With Different Subjects
Literature
Students barely have time to read. We got you! Have your literature essay or book review written without having the hassle of reading the book. You can get your literature paper custom-written for you by our literature specialists.
Finance
Do you struggle with finance? No need to torture yourself if finance is not your cup of tea. You can order your finance paper from our academic writing service and get 100% original work from competent finance experts.
Computer science
Computer science is a tough subject. Fortunately, our computer science experts are up to the match. No need to stress and have sleepless nights. Our academic writers will tackle all your computer science assignments and deliver them on time. Let us handle all your python, java, ruby, JavaScript, php , C+ assignments!
Psychology
While psychology may be an interesting subject, you may lack sufficient time to handle your assignments. Don’t despair; by using our academic writing service, you can be assured of perfect grades. Moreover, your grades will be consistent.
Engineering
Engineering is quite a demanding subject. Students face a lot of pressure and barely have enough time to do what they love to do. Our academic writing service got you covered! Our engineering specialists follow the paper instructions and ensure timely delivery of the paper.
Nursing
In the nursing course, you may have difficulties with literature reviews, annotated bibliographies, critical essays, and other assignments. Our nursing assignment writers will offer you professional nursing paper help at low prices.
Sociology
Truth be told, sociology papers can be quite exhausting. Our academic writing service relieves you of fatigue, pressure, and stress. You can relax and have peace of mind as our academic writers handle your sociology assignment.
Business
We take pride in having some of the best business writers in the industry. Our business writers have a lot of experience in the field. They are reliable, and you can be assured of a high-grade paper. They are able to handle business papers of any subject, length, deadline, and difficulty!
Statistics
We boast of having some of the most experienced statistics experts in the industry. Our statistics experts have diverse skills, expertise, and knowledge to handle any kind of assignment. They have access to all kinds of software to get your assignment done.
Law
Writing a law essay may prove to be an insurmountable obstacle, especially when you need to know the peculiarities of the legislative framework. Take advantage of our top-notch law specialists and get superb grades and 100% satisfaction.
What discipline/subjects do you deal in?
We have highlighted some of the most popular subjects we handle above. Those are just a tip of the iceberg. We deal in all academic disciplines since our writers are as diverse. They have been drawn from across all disciplines, and orders are assigned to those writers believed to be the best in the field. In a nutshell, there is no task we cannot handle; all you need to do is place your order with us. As long as your instructions are clear, just trust we shall deliver irrespective of the discipline.
Are your writers competent enough to handle my paper?
Our essay writers are graduates with bachelor's, masters, Ph.D., and doctorate degrees in various subjects. The minimum requirement to be an essay writer with our essay writing service is to have a college degree. All our academic writers have a minimum of two years of academic writing. We have a stringent recruitment process to ensure that we get only the most competent essay writers in the industry. We also ensure that the writers are handsomely compensated for their value. The majority of our writers are native English speakers. As such, the fluency of language and grammar is impeccable.
What if I don’t like the paper?
There is a very low likelihood that you won’t like the paper.
Reasons being:
- When assigning your order, we match the paper’s discipline with the writer’s field/specialization. Since all our writers are graduates, we match the paper’s subject with the field the writer studied. For instance, if it’s a nursing paper, only a nursing graduate and writer will handle it. Furthermore, all our writers have academic writing experience and top-notch research skills.
- We have a quality assurance that reviews the paper before it gets to you. As such, we ensure that you get a paper that meets the required standard and will most definitely make the grade.
In the event that you don’t like your paper:
- The writer will revise the paper up to your pleasing. You have unlimited revisions. You simply need to highlight what specifically you don’t like about the paper, and the writer will make the amendments. The paper will be revised until you are satisfied. Revisions are free of charge
- We will have a different writer write the paper from scratch.
- Last resort, if the above does not work, we will refund your money.
Will the professor find out I didn’t write the paper myself?
Not at all. All papers are written from scratch. There is no way your tutor or instructor will realize that you did not write the paper yourself. In fact, we recommend using our assignment help services for consistent results.
What if the paper is plagiarized?
We check all papers for plagiarism before we submit them. We use powerful plagiarism checking software such as SafeAssign, LopesWrite, and Turnitin. We also upload the plagiarism report so that you can review it. We understand that plagiarism is academic suicide. We would not take the risk of submitting plagiarized work and jeopardize your academic journey. Furthermore, we do not sell or use prewritten papers, and each paper is written from scratch.
When will I get my paper?
You determine when you get the paper by setting the deadline when placing the order. All papers are delivered within the deadline. We are well aware that we operate in a time-sensitive industry. As such, we have laid out strategies to ensure that the client receives the paper on time and they never miss the deadline. We understand that papers that are submitted late have some points deducted. We do not want you to miss any points due to late submission. We work on beating deadlines by huge margins in order to ensure that you have ample time to review the paper before you submit it.
Will anyone find out that I used your services?
We have a privacy and confidentiality policy that guides our work. We NEVER share any customer information with third parties. Noone will ever know that you used our assignment help services. It’s only between you and us. We are bound by our policies to protect the customer’s identity and information. All your information, such as your names, phone number, email, order information, and so on, are protected. We have robust security systems that ensure that your data is protected. Hacking our systems is close to impossible, and it has never happened.
How our Assignment Help Service Works
1. Place an order
You fill all the paper instructions in the order form. Make sure you include all the helpful materials so that our academic writers can deliver the perfect paper. It will also help to eliminate unnecessary revisions.
2. Pay for the order
Proceed to pay for the paper so that it can be assigned to one of our expert academic writers. The paper subject is matched with the writer’s area of specialization.
3. Track the progress
You communicate with the writer and know about the progress of the paper. The client can ask the writer for drafts of the paper. The client can upload extra material and include additional instructions from the lecturer. Receive a paper.
4. Download the paper
The paper is sent to your email and uploaded to your personal account. You also get a plagiarism report attached to your paper.
PLACE THIS ORDER OR A SIMILAR ORDER WITH US TODAY AND GET A PERFECT SCORE!!!
