This Final Year Project proposes an Intelligent Big Data Management and Study Console (IBDMSC). The idea is to provide an integrated application that helps in migration, management, monitoring and archiving local, cloud and multi-cloud big-data repositories. This application will demonstrate the mig
Intelligent Big Data Management and Study Console
This Final Year Project proposes an Intelligent Big Data Management and Study Console (IBDMSC). The idea is to provide an integrated application that helps in migration, management, monitoring and archiving local, cloud and multi-cloud big-data repositories. This application will demonstrate the migration process with big-data sets to migrate from one-host to another host, along with validation, logging and archiving. This application also enables time estimates and storage analysis for the migration process. The project proposed a layered model for the application, testing and deployment to local, cloud and multi-cloud platform and benchmarking the migration process for some standard big-datasets in such an environment.
Many factors should be considered before initiating a data migration process like type of data, infrastructure of the local server & third-party storage and etc. Failure during the process can cause a big loss to the organization. Therefore this application provides a solution to all these problems. It aims to provide a secure data transfer between local server to cloud and cloud to cloud.
Data processing has become a new domain in computer science. Researchers are working on optimizing the data storage, minimizing the computation time and securing data from being stolen or deleted. Data processing includes different processes like data filtering, data cleaning, data manipulation and data migration etc. With the evolving technologies businesses are being transformed and generating an enormous amount of data which is an important asset for organizations. Standards and technology evolve continuously, so do business requirements and often they are the main drivers for migration. Such projects are not only costly and resource-intensive, but also require meticulous planning, right tools, and intensive testing to succeed. Migration of data is a process of moving data from one system to another i.e. from the legacy source systems into a new system, known as the target system or the application. The combination of the occurrence of data migration processes and the resources consumed in a data migration project results in significant amount of the IT budget for any company. 84% of the data migration projects fail to meet the expectations. These failures occur due to the difference of infrastructure between the organization storage and cloud, structure of data and etc.
This Project offers a complete data migration solution from cloud to cloud or cloud to Local Server and vice versa. It offers the data migration service on different clouds i.e. Microsoft Azure, Amazon AWS, Ali Baba, and Openstack (locally hosted), where user can transfer files or big data sets from one cloud to another without having the information or knowledge about the implementation of the transfer protocols. Application is designed in a layered architecture in which different layers are working concurrent to each other having different functionalities.
Project was implemented using agile methodology. Since it was a research and product based project, research was carried out about Big Data, Clouds and technologies supporting reliable data migration following the research implementation of the project was started. Data sets like Million Song Data and etc. were collected for the testing of successful file transfer through the system with all the layers i.e. communication, migration, monitoring and logging working suitably. Other than this research on different cloud infrastructures and their APIs/ services was also carried out to have a clear image of how these clouds works and which cloud is better in different scenarios.
Project mainly focused on transfer of three types of data i.e unstructured (file), semi-structured (NoSQL) and structured (Relational Data). To easy understand the scenario the project was divided into cases. First case was the transfer of data from local machine to cloud. Second was the transfer of data from one cloud to another. Third was the transferring of data from locally hosted cloud (Openstack) to cloud. These cases were further divided into transferring a particular form of data that is either file structure, RDBMS or NoSQL. Each cloud had a different implementation of each data format since every cloud has a different service for it.
Data transfer was implemented on different clouds. Accounts of multiple clouds were first created on trial version or student accounts but due to limited access to the services and issues in receiving the credentials, accounts were switched to paid accounts to have full access to the services. Services included Object Storages, Relational Databases, and NoSQL Databases. Services were charged on the basic of instances created, which were needed every time data was to be transferred. Charges were also raised by the duration of service used.

| Services | Charges |
| SQL Database | $2.0175/hour (Basic) |
| Cosmos DB | $24/month(minimum)+ Instance Creation Charges $6 |
| Blob Storage | $17.9/Month |
| API Access Charges | $3.83/hour |
| Services | Charges |
| Apsara DB | (US$0.19/GB/Month Subscription +US$0.156/GB Internet Traffic +US$0.012 Monitor ) for one Instance |
| Table Store | ( US$0.00030/GB Data Storage + US$0.123/GB Internet traffic ) Per Hour for One Instance |
| Object Storage | US$0.0200/GB/Month |
| API Access Charges | Around $0.015 |
| Services | Charges |
| Amazon RDS | 1 Instance Running $26.33/Hour |
| Dynamo DB | $48.78 /Month |
| AWS S3 | $ 2.43 /Month |
| API Access | $0.044/Hour |
Project Involves creation and deletion of multiple instances for testing purpose .If we use the above mentioned resources with minimum cost to bear, Project Estimate Budget Required will be around Rs. 90,000.
Note: We've considered our services cost as equipment cost. Project equipment item details restricting us to write actual costs.
Services
SQL Database
Cosmos DB
Blob Storage
API Access Charges
Services
Apsara DB
Table Store
Object Storage
API Access Charges
Services
Amazon RDS
Dynamo DB
AWS S3
API Access
Final Deliverables of the project includes the working web application. Web application will be capable of effectively transferring of data along with monitoring and logging of that data.
| Services | Charges |
| Apsara DB | (US$0.19/GB/Month Subscription +US$0.156/GB Internet Traffic +US$0.012 Monitor ) for one Instance |
| Table Store | ( US$0.00030/GB Data Storage + US$0.123/GB Internet traffic ) Per Hour for One Instance |
| Object Storage | US$0.0200/GB/Month |
| API Access Charges | Around $0.015 |
We are going to implement IOT based smart parking system. This main proposes is to...
The machine we are proposing here will be used for the offices, shopping malls & resta...
In this project, a simple energy meter can be turned into a di...
In greenhouses, monitoring and controlling of many parameters are important for the good q...
DoTask App is for all the platforms like Android, IOS , Desktop and for the web. App is a...