Thursday, July 14, 2011

Big Data with Cloud computing (Amazon Web Service)

As Big Data projects requires huge amount of resources, cloud computing helps to avoid resource maintenance headache, Already discussed more details in previous post. Here discuss more about big data with cloud provider Amazon Web Service

Three major resource requires for any type of computing are Processor (CPU), Memory (RAM), Storage (Hard disk). Amount of each resource requires for projects vary, especially for big data project need more for processing huge amount of data. 

Amazon provides Elastic Compute Cloud (EC2) instance, is similar like a single desktop machine which is cloud, user can compute over network. EC2 are available in different types for user from micro instance to very large instance. user can create any number of EC2 instance. EC2 small instance has memory of 1.7GB, 160GB hard disk and one 32-bit processor. Those who requires more power in single instance can go up-to EC2 Cluster GPU instance, which has 22 GB memory, 33.5 processor(64-bit), 1680GB hard disk.


For big data project, resources need may increase at any time, some time data size will grow, or complexity of process and analysis will increase. For this type of situation, Amazon provide Amazon Cloud Watch which keeps monitoring activity of our resource usage in cloud, when demand arise, user can use Amazon Auto scaling feature to increase the capacity of instance by writing some condition in cloud watch. and decrease it back when demands dulls. Using Amazon Auto scaling service, user can also increase number of instances. In our Big data term ‘user can also increase number of nodes when demand raised’. Auto Scaling real time example - click here (check last paragraph)

Lot of cloud providers offers EC2 type of instance, but the main reason i write about Amazon (AWS)  is, Amazon also provider Amazon Elastic MapReduce instance which pre-configured with hadoop framework. So with Amazon Elastic MapReduce user can create a cluster with any number of nodes in few minutes. This hadoop cluster is well performance tuned, user just need to select type of instance(node) and number of node requires, everything else will be taken care by Amazon. This one answer ‘how to do’ for our previous post question ‘Why should we use Cloud Computing for Big data’ 


Amazon S3 (Simple Storage service) provides storage space to store user data. user can store unlimited amount of data in S3 it will be persist even our instance goes down. For processing huge amount of data, its recommended to store all data in S3 and can refer directly from Amazon Elastic MapReduce to process, and also can store processed result data in S3. It is easy to inter-operate between different amazon web services.

48 comments:

  1. Cloud computing system is based on interface software is opened via web browser. Your data remains secure in cloud and you can connect to your cloud from any PC which has internet connection.

    ReplyDelete
  2. Nice explanation on Amazon EC2, very easy to understand :)

    ReplyDelete
  3. Your content is most fortunately cherished. Thank you very much for providing a ocean of useful post.
    cloud backup

    ReplyDelete
  4. Thanks for this sharing.Migrate your existing application into the Cloud quickly and easily.We are using cloud computing environment in our technology.

    ReplyDelete
  5. Your content is useful for big file storage system in cloud Computing.

    ReplyDelete
  6. Nice Post.
    informative and interesting information is here.
    Thanks for sharing with us. I am looking another great post.
    Cloud computing

    ReplyDelete
  7. Outstanding blog thanks for sharing such wonderful blog with us ,after long time came across such knowledgble blog. keep sharing such informative blog with us.
    Air Hostess Training in Chennai | Air Hostess Training Institute in Chennai | Air Hostess Academy in Chennai | Air Hostess Course in Chennai | Air Hostess Institute in Chennai

    ReplyDelete
  8. Thanks for sharing,this blog makes me to learn new thinks.
    interesting to read and understand.keep updating it.
    AWS Training center in Bangalore
    AWS Training in Ambattur
    AWS Training in Saidapet

    ReplyDelete
  9. I am obliged to you for sharing this piece of information here and updating us with your resourceful guidance. Hope this might benefit many learners. Keep sharing this gainful articles and continue updating us.
    Ethical Hacking Course in Chennai 
    Hacking Course in Chennai 
    Ethical Hacking Training in Chennai 
    Certified Ethical Hacking Course in Chennai 
    Ethical Hacking Course 

    ReplyDelete
  10. This blog is very much helpful to us. Thanks for your information

    Article submission sites

    Education

    ReplyDelete
  11. Hi, Your post is quite great to view and easy way to grab the extra knowledge. Thank you for your share with us. I like to visit your site again for my future reference.
    Robotic Process Automation Training
    RPA course
    Robotic Process Automation Certification
    RPA Training
    RPA Training Institute in Chennai
    Robotic Process Automation Courses

    ReplyDelete
  12. This comment has been removed by the author.

    ReplyDelete
  13. Thank you for such an amazing post. Keep sharing this kind of useful information.

    Primavera Training in Velachery

    Primavera Courses in Velachery

    Primavera Training in Tambaram

    <a href="https://www.fita.in/primavera-training-in-chennai/”>Primavera Courses in Tambaram</a>

    ReplyDelete
  14. I am obliged to you for sharing this piece of information here and updating us with your resourceful guidance. Hope this might benefit many learners. Keep sharing this gainful articles and continue updating us.
    Salesforce Training in Chennai
    Salesforce Training
    Salesforce Training institutes in Chennai
    Salesforce Course in Chennai
    Salesforce developer training in chennai
    Salesforce Course

    ReplyDelete
  15. Awesome post. Really you are shared very informative concept... Thank you for sharing. Keep on
    updating...

    imedicalassistants
    Technology

    ReplyDelete
  16. I am obliged to you for sharing this piece of information here and updating us with your resourceful guidance. Hope this might benefit many learners. Keep sharing this gainful articles and continue updating us.
    Selenium training in chennai
    Selenium training institute in Chennai
    iOS Course Chennai
    Digital Marketing Training in Chennai
    hp loadrunner training
    Loadrunner Training in Velachery

    ReplyDelete
  17. Thanks for the great post on your blog, it really gives me an insight on this topic.I must thank you for this informative read. I hope you will post again soon.
    Cloud Computing Training in Perungudi
    Cloud Computing Training in Saidapet
    Cloud Computing Training in Nolambur
    Cloud computing Training Bangalore

    ReplyDelete
  18. Thank you for your guide to with upgrade information about Hadoop
    Hadoop admin Online Training

    ReplyDelete
  19. This comment has been removed by the author.

    ReplyDelete
  20. This is a good post. This post give truly quality information. I’m definitely going to look into it. Really very useful tips are provided here. thank you so much. Keep up the good works.
    Best Android Training in Chennai
    Android Course in Chennai with placement
    Android Training Center in Chennai
    Aws Certification in Chennai
    Amazon Web Services Training in Chennai
    AWS Training centers in Chennai

    ReplyDelete
  21. Really I enjoy your site with effective and useful information. It is included very nice post with a lot of our resources.thanks for share. i enjoy this post. Shopin

    ReplyDelete
  22. Nice article. I liked very much. All the informations given by you are really helpful for my research. keep on posting your views.
    ccna course in Chennai
    ccna Training in Chennai
    ccna Training institute in Chennai
    ccna institute in Chennai
    Best CCNA Training Institute in Chennai

    ReplyDelete