Skip to main content

PIG : Reading data from file


To read the data from a file we can use the LOAD command. Assume there is a file named player.csv (downloaded public dataset of english premier league player from one of the open data set).

Sample Data from player.csv file

Player id,Player,Position,Number,Club,Club (country),D.O.B,Age,Height (cm),Country,Caps,International goals,Plays in home country
336722,Alan PULIDO,Forward,11,Tigres UANL,Mexico,08.03.1991,23,176,Mexico,5,4,TRUE
368902,Adam TAGGART,Forward,9,Newcastle United Jets FC,Australia,02.06.1993,21,172,Australia,4,3,TRUE
362641,Reza GHOOCHANNEJAD,Forward,16,Charlton Athletic FC,England,20.09.1987,26,181,Iran,13,9,FALSE

Pig script to load the data. We must specify the record structure of the file.

grunt> player_data  = LOAD 'players.csv'
       USING PigStorage(',')
       AS
       (player_id:int,
       player:chararray,
       position:chararray,
       number:int,
       club:chararray,
       club_country:chararray,
       d_o_b:chararray,
       age:int,
       height_cm:int,
       country:chararray,
       caps:chararray,
       international_goals:chararray,
       plays_home_country:chararray);

grunt> DUMP player_data;

Sample Output

(380000,Marcelo BROZOVIC,Midfielder,14,GNK Dinamo Zagreb,Croatia,16.11.1992,21,180,Croatia,0,0,TRUE)
(380009,Luis LOPEZ,Goalkeeper,1,Real Espana,Honduras,13.09.1993,20,182,Honduras,0,0,TRUE)
(379910,Adnan JANUZAJ,Midfielder,20,Manchester United FC,England,05.02.1995,19,180,Belgium,0,0,FALSE)



Comments

Popular posts from this blog

C programing : File

/* Program to take lines 1,4,7,10.... from a text file and to write into a new file. It is written in c with platform devcpp*/ /* eg:- input file -> ab.txt contain 1 lady gaga sdrgrg ergerg 2 oraph winfrey dfgdr dfgdf . output file neww.txt 1 lady gaga 2 oraph winfrey . . */ #include #include #include using namespace std; main() { char c; FILE *fp1,*fp2; if(fp1=fopen("e:\\ab.txt","r")) { cout<<"\n opened fp1"; } else { cout<<"\n failed fp1"; } if( fp2=fopen("e:\\neww.txt","w") ) { cout<<"\n opened fp2"; } else { cout<<"\n failed fp2"; } while(!feof(fp1)) { c=getc(fp1); while(c!='\n') { putc(c,fp2); c=getc(fp1); } putc('\n',fp2); c=getc(fp1); while(c!='\n') { c=getc(fp1); } c=getc(fp1); while(c!='\n') { c=getc(fp1); } } cout<<"\n End"; getch(); return 0; }

OpenCV installation with Visual Studio 2010

Hi all.I will show you how to install OpenCV library in windows with Visual Studio 2010 to use with Visual C++ Here I am show to install OpenCV Ver 2.2. 1. Download the OpenCV from source forge     The link is : http://sourceforge.net/projects/opencvlibrary/files/opencv-win/ 2. Install the OpenCV to any drive.I have installed as C:\OpenCV2.2 3. Next open Visual Studio 2010.Select New>Project 4. Select Win32 Console Application from Visual C++ popup. 5. Give the project a name and press OK . I gave as 'helloworld' 6. Click Finish to continue.You can see a new file has opened where you can type your code. 7. Next select Project> <your project name> Properties . 8. Go to VC++ Directories and select Include Directories.There add the two links to that     a) C:\OpenCV2.2\include     b) C:\OpenCV2.2\include\opencv 9. Next Go to Library Directories and add the following link     a) C:\OpenCV2.2\lib 10. Now Go to Linker option from the left

Install sublime editor

Hi. Sublime editor is a light weight editor and very helpful for developers to write code.It will highlight the code in colors for easy readability. Sublime editor 3 is the latest one available as of now . The below link shows how to install sublime editor in CentOS . http://software-engineer.gatsbylee.com/how-to-install-sublime-3-on-centos-7-rhel-7/ To know about you system OS is 32 bit or 64 bit , use the below command. >uname -a