Big Data Frameworks for Efficient Range Queries to Extract Interested Rectangular Sub Regions

A satellite object can consist of more than one mosaic image. To extract any object from remote sensing satellite images, mosaic images need to be stitched. It is critical problem that which mosaics will be selected for image stitching among big mosaic dataset. In this paper, we propose two approaches to overcome mosaic selection problem … Read more Big Data Frameworks for Efficient Range Queries to Extract Interested Rectangular Sub Regions

An Approach For Stitching Satellite Images in a Bigdata Mapreduce Framework

In this study we present a two-step map/reduce framework to stitch satellite mosaic images. The proposed system enable recognition and extraction of objects whose parts falling in separate satellite mosaic images. However this is a time and resource consuming process. The major aim of the study is improving the performance of the image stitching processes … Read more An Approach For Stitching Satellite Images in a Bigdata Mapreduce Framework

A Study for Adaptation of Image Stitching to Big Data Frameworks

In this study, we adopt image stitching process to bigdata frameworks. To do so, an algorithm is presented to merge two large images in accordance with Hadoop’s map/reduce computation paradigm. Images are first converted to bitmaps which are represented as matrices of 0s and 1s. The algorithm then finds the best possible match among two … Read more A Study for Adaptation of Image Stitching to Big Data Frameworks

Apache Spark ile Anket Verilerindeki Tutarsızlığının Tespiti

Bu projede, analiz metotları kullanılarak anket verileri üzerinde bir analiz yaparak çıkan sonuçlar içerisinde bir anormallik olup olmadığının tespitinin yapılması beklenmektedir. Anormal veri olarak bahsettiğimiz veriler, daha önceden elimizde bulunan verilere göre oluşturduğumuz modele uymayan veri veya veri setidir. Kısacası, beklenenden farklı olan değerlerdir. Gün geçtikçe dünya üzerindeki veri miktarı hızlıca artmaktadır. Fakat, bu verilerin … Read more Apache Spark ile Anket Verilerindeki Tutarsızlığının Tespiti

A scalable distributed query framework for unstructured big clinical data: A case study on diabetic records

Abstract—Unstructured data forms close to 80% of information in the healthcare industry and is growing exponentially. Analyzing and querying of those type of data is not efficient with traditional relational database technologies. In this paper, we propose a distributed and scalable big data framework for querying and analyzing of unstructured clinical data. The framework is … Read more A scalable distributed query framework for unstructured big clinical data: A case study on diabetic records

Analyzing Big Security Logs in Cluster with Apache Spark

Abstract. Cyber security is the major concern in today’s highly net- worked environment and logging is the primary way of tracking compli- ance with the security policies. However analyzing the massive amount of logs has become a “Big Data” problem. Apache Spark is one of the latest and most notable incarnation of Data Flow Models … Read more Analyzing Big Security Logs in Cluster with Apache Spark