Show rating breakdown
Save to My Lists
Unclaimed
Unclaimed

Top Rated Apache Sqoop Alternatives

Apache Sqoop Reviews & Product Details

Saurav M.
SM
Big Data Developer
Enterprise(> 1000 emp.)
More Options
Validated Reviewer
Review source: G2 invite
Incentivized Review
What do you like best about Apache Sqoop?

The simplicity and effectiveness of the application wins my heart. Also, user-centric design is awesome. I am using Sqoop to import data from external datastores into Hadoop Distributed File System or related Hadoop eco-systems like Hive and HBase. The best benefit is how easy it is to use and how fast it is. Sqoop can easily integrate with Hadoop and dump structured data from relational databases on HDFS, complimenting the power of Hadoop. This is why Big Data and Hadoop certification mandates a sound knowledge of Apache Sqoop and Flume. Review collected by and hosted on G2.com.

What do you dislike about Apache Sqoop?

I did not really find anything that I didn't like but if in future I did, I would love to share it. There are few issues using sqoop which buged me initially but are easy to handle:

Sqoop Connector:

Issue:

Use of incorrect connector for the Database to be connected while doing sqoop export or sqoop import.

Missing driver or use of correct driver name of the respective jdbc class for sqoop command.

Missing connection-manager name in some cases of sqoop command.

Incorrect approach of giving password or username of the Database to be connected.

Format of the data stored in HDFS/Hive Tables can create issues. There are few formats such as ORC files which wont allow direct data transfer using sqoop.

Non-matching or incorrect names of columns of source and destination tables where HCatalog is used in sqoop command can show successful sqoop job without the data being transferred. Review collected by and hosted on G2.com.

Recommendations to others considering Apache Sqoop:

Apache Sqoop is designed to efficiently transfer enormous volumes of data between Apache Hadoop and structured datastores such as relational databases. It helps to offload certain tasks, such as ETL processing, from an enterprise data warehouse to Hadoop, for efficient execution at a much lower cost. Sqoop also makes it easy to extract data from Hadoop and export it to external structured datastores. Review collected by and hosted on G2.com.

What problems is Apache Sqoop solving and how is that benefiting you?

I am using Sqoop to import data from external datastores into Hadoop Distributed File System or related Hadoop eco-systems like Hive and HBase. The best benefit is how easy it is to use and how fast it is. Review collected by and hosted on G2.com.

Apache Sqoop Overview

What is Apache Sqoop?

Apache Sqoop is a tool designed for efficiently transferring bulk data between Apache Hadoop and structured datastores such as relational databases.

Apache Sqoop Details
Show LessShow More
Product Description

Apache Sqoop is a tool designed for efficiently transferring bulk data between Apache Hadoop and structured datastores such as relational databases.


Seller Details
Year Founded
1999
HQ Location
Wakefield, MA
Twitter
@TheASF
66,224 Twitter followers
LinkedIn® Page
www.linkedin.com
2,291 employees on LinkedIn®
Description

Community-led development since 1999. FoundationProjectsPeopleGet InvolvedDownloadSupport ApacheHome. We consider ourselves not simply a group of projects sharing a server, but rather a community of developers and users.

Recent Apache Sqoop Reviews

Shubhashish V.
SV
Shubhashish V.Enterprise (> 1000 emp.)
4.5 out of 5
"Data sqoop from informatica and oracle in Big data applications"
Apache is very helpful in extracting big data set with minimal time .It can be integrated and implemented with many of similar application where b...
Zubin D.
ZD
Zubin D.Enterprise (> 1000 emp.)
5.0 out of 5
"A versatile utility for data move movement and basic sql functions."
The simplicity with which the tool can be used at the get with minimal setup in a distributed environment and the short learning curve.
Verified User
U
Verified UserSmall-Business (50 or fewer emp.)
3.5 out of 5
"Command line interface application for transferring data between database and hadoop"
Data transfer is in parallel,making it fast and cost effective.
Security Badge
This seller hasn't added their security information yet. Let them know that you'd like them to add it.
0 people requested security information

Apache Sqoop Media

Answer a few questions to help the Apache Sqoop community
Have you used Apache Sqoop before?
Yes

Video Reviews

30 out of 31 Total Reviews for Apache Sqoop

4.3 out of 5
The next elements are filters and will change the displayed results once they are selected.
Search reviews
Popular Mentions
The next elements are radio elements and sort the displayed results by the item selected and will update the results displayed.
Hide FiltersMore Filters
The next elements are filters and will change the displayed results once they are selected.
The next elements are filters and will change the displayed results once they are selected.
30 out of 31 Total Reviews for Apache Sqoop
4.3 out of 5
30 out of 31 Total Reviews for Apache Sqoop
4.3 out of 5

Apache Sqoop Pros and Cons

How are these determined?Information
Pros and Cons are compiled from review feedback and grouped into themes to provide an easy-to-understand summary of user reviews.
Pros
Cons

Overall Review Sentiment for Apache SqoopQuestion

Time to Implement
<1 day
>12 months
Return on Investment
<6 months
48+ months
Ease of Setup
0 (Difficult)
10 (Easy)
Log In
Want to see more insights from verified reviewers?
Log in to view review sentiment.
G2 reviews are authentic and verified.
Shubhashish V.
SV
Data Engineer
Enterprise(> 1000 emp.)
More Options
Validated Reviewer
Verified Current User
Review source: G2 invite
Incentivized Review
What do you like best about Apache Sqoop?

Apache is very helpful in extracting big data set with minimal time .It can be integrated and implemented with many of similar application where big data is involved with frequency use Review collected by and hosted on G2.com.

What do you dislike about Apache Sqoop?

Sometimes query takes more time in execution when many join or left outer join or other join involved with extra filter in where condition . The failure during partial import happened in long query Review collected by and hosted on G2.com.

What problems is Apache Sqoop solving and how is that benefiting you?

It is helping in creating big data set from 2 or many table with filter in easy way so that we can use that optimise data to support or develop our applications for business User Review collected by and hosted on G2.com.

Zubin D.
ZD
Associate Software Engineering Manager
Enterprise(> 1000 emp.)
More Options
Validated Reviewer
Verified Current User
Review source: G2 invite
Incentivized Review
What do you like best about Apache Sqoop?

The simplicity with which the tool can be used at the get with minimal setup in a distributed environment and the short learning curve. Review collected by and hosted on G2.com.

What do you dislike about Apache Sqoop?

The logging seemed to be something I personally struggled with identifying data anomalies when came to data movement in my use cases. Review collected by and hosted on G2.com.

What problems is Apache Sqoop solving and how is that benefiting you?

Data movement between a distributed environment and a relation database Review collected by and hosted on G2.com.

Verified User in Accounting
UA
Small-Business(50 or fewer emp.)
More Options
Validated Reviewer
Review source: Organic
What do you like best about Apache Sqoop?

Data transfer is in parallel,making it fast and cost effective. Review collected by and hosted on G2.com.

What do you dislike about Apache Sqoop?

The failure during partial import and export need special handling. Review collected by and hosted on G2.com.

What problems is Apache Sqoop solving and how is that benefiting you?

It involves transferring the data from a variety of structured sources of data like Oracle,postgree etc. Review collected by and hosted on G2.com.

A P.
AP
Freelance Data science/ big data trainer
Mid-Market(51-1000 emp.)
More Options
Validated Reviewer
Review source: Organic
What do you like best about Apache Sqoop?

Incremental imports are most useful in sqoop Review collected by and hosted on G2.com.

What do you dislike about Apache Sqoop?

sometimes when the same database is used for other business applications and my queries involve multiple joins, performance is impacted Review collected by and hosted on G2.com.

Recommendations to others considering Apache Sqoop:

I recommend Apache sqoop because of its ease of use. Review collected by and hosted on G2.com.

What problems is Apache Sqoop solving and how is that benefiting you?

Sqoop is solving the problem of easily importing the data updates done at the source database using incremental imports and automating these tasks using sqoop jobs. Review collected by and hosted on G2.com.

Kubendra Reddy M.
KM
Data Engineer
Enterprise(> 1000 emp.)
More Options
Validated Reviewer
Review source: G2 invite
Incentivized Review
What do you like best about Apache Sqoop?

Best thing about is it excutes the data transfer parallel.It allows to transfer the data from variety of structured databases. It has huge support community. Review collected by and hosted on G2.com.

What do you dislike about Apache Sqoop?

Under the hood it uses MapReduce which takes time for even small data transfer. Implementing Change data capture and incremental loads is quite complex.It cannot be paused and resumed. Review collected by and hosted on G2.com.

Recommendations to others considering Apache Sqoop:

Standard data transfer tool. Review collected by and hosted on G2.com.

What problems is Apache Sqoop solving and how is that benefiting you?

We used to import the dtaa from RDBMS to Hadoop cluster and used to export the data from Hadoop to RDBMS. We used this to transfer the data parallely for better performance. Review collected by and hosted on G2.com.

srinu k.
SK
Senior ETL Consultant
Enterprise(> 1000 emp.)
More Options
Validated Reviewer
Review source: G2 invite
Incentivized Review
GS
Data Scientist
Mid-Market(51-1000 emp.)
More Options
Validated Reviewer
Review source: G2 invite
Incentivized Review
What do you like best about Apache Sqoop?

The best thing about apache sqoop is it provides easy configuration for getting the data in real time from source system Review collected by and hosted on G2.com.

What do you dislike about Apache Sqoop?

The thing which i disliked about apache sqoop is once the pipleline is broked then it is tough to recover lost messages Review collected by and hosted on G2.com.

What problems is Apache Sqoop solving and how is that benefiting you?

I have used apache Sqoop for getting the real time data from twitter source apis and then process the data Review collected by and hosted on G2.com.

Nikunj P.
NP
Senior Software Engineer
Computer Software
Enterprise(> 1000 emp.)
More Options
Validated Reviewer
Verified Current User
Review source: Seller invite
What do you like best about Apache Sqoop?

The usage is very simple. It's very user friendly. We need not write lot of lines of code to get the data from db or write back to db Review collected by and hosted on G2.com.

What do you dislike about Apache Sqoop?

There is nothing I can see as of now. If we get supporfor swooping nosql dbs that would be great Review collected by and hosted on G2.com.

What problems is Apache Sqoop solving and how is that benefiting you?

For data warehouse and analytics we are sqooping data from various dbs and making it available in Hadoop for processing and analytics Review collected by and hosted on G2.com.

Verified User in Telecommunications
UT
Enterprise(> 1000 emp.)
More Options
Validated Reviewer
Review source: Seller invite
What do you like best about Apache Sqoop?

Replication of Relational DB onto HDFS for MR jobs Review collected by and hosted on G2.com.

What do you dislike about Apache Sqoop?

The data had to be re-imported every time the data was changed Review collected by and hosted on G2.com.

What problems is Apache Sqoop solving and how is that benefiting you?

We had a huge volume of data in Relational DB. IN order to derive the aggregated KPI, I had to run SQL's . using sqoop , same SQLfrom Relational DB was run on sqoop Review collected by and hosted on G2.com.

Vijay A.
VA
Engineer
Enterprise(> 1000 emp.)
More Options
Validated Reviewer
Review source: G2 invite
Incentivized Review
(Original )Information
What do you like best about Apache Sqoop?

Best suited in our pipeline where we load/unload data from postgreSQL.

Suited for out data format in AVRO.

Under normal circumstances execution is fast and cost effective. Review collected by and hosted on G2.com.

What do you dislike about Apache Sqoop?

When loading huge data it is becoming performance bottle neck for other apps working with same datastore.

There is no pause & resume feature. We have to start job again. Review collected by and hosted on G2.com.

Recommendations to others considering Apache Sqoop:

Getting skilled in sqoop is quite easy and fun. Review collected by and hosted on G2.com.

What problems is Apache Sqoop solving and how is that benefiting you?

In our ETL Pipeline we need to load data from PostgreSQL to HDFS and process the data and load it. It is very fast in execution. Review collected by and hosted on G2.com.