最近使用了阿里云的OCR文字識別API
先來看看效果
我使用的是通用類文字識別走诞,具體實現(xiàn)過程如下:
1.購買阿里云的通用類文字識別
目前是0元免費的,可以使用500次堡称。購買成功后到->控制臺->云市場查看購買的API章钾,復(fù)制它的APPCODE碼墙贱。
2.根據(jù)官方給出的API文檔提交請求
我使用的Retrofit提交網(wǎng)絡(luò)請求,定義如下的接口:
interface AliService{
@POST("/api/predict/ocr_general")
Call<HttpResult> getText(@Body RequestBody body,@Header("Authorization") String authorization);
}
根據(jù)官方提供的返回json實例贱傀,自定義一個HTTPResult類用于接收數(shù)據(jù)惨撇,記得添加Getter and Setter方法和構(gòu)造方法:
public class HttpResult{
private String request_id;
private List<Bean> ret;
private boolean success;
}
class Bean{
private Rect rect;
private String word;
class Rect{
private float angle;
private float height;
private float left;
private float top;
private float width;
}
}
由于圖片是bitmap格式的,我們必須要將圖片進(jìn)行base64編碼后進(jìn)行請求府寒。
public static String bitmapToBase64(Bitmap bitmap) {
ByteArrayOutputStream bos = new ByteArrayOutputStream();
bitmap.compress(Bitmap.CompressFormat.JPEG, 40, bos);//參數(shù)100表示不壓縮
byte[] bytes = bos.toByteArray();
//轉(zhuǎn)換來的base64碼不需要加前綴魁衙,必須是NO_WRAP參數(shù)报腔,表示沒有空格。
return Base64.encodeToString(bytes, Base64.NO_WRAP);
//轉(zhuǎn)換來的base64碼需要需要加前綴剖淀,必須是NO_WRAP參數(shù)纯蛾,表示沒有空格。
//return "data:image/jpeg;base64," + Base64.encodeToString(bytes, Base64.NO_WRAP);
}
根據(jù)官方文檔里的請求參數(shù)纵隔,構(gòu)建出請求體:
Retrofit retrofit = new Retrofit.Builder()
.baseUrl("https://tysbgpu.market.alicloudapi.com")
.addConverterFactory(GsonConverterFactory.create())
.build();
AliService aliService = retrofit.create(AliService.class);
String body = "{\"image\":\""+bitmapToBase64(bitmap)+"\"," +
"\"configure\":{\"min_size\":16,\"output_prob\":false,\"output_keypoints\":false,\"skip_detection\":false,\"without_predicting_direction\":false}}";
RequestBody requestBody = RequestBody.create(okhttp3.MediaType.parse("application/json;charset=UTF-8"), body);
Call<HttpResult> call = aliService.getText(requestBody, "APPCODE " + APPCODE);
call.enqueue(new Callback<HttpResult>() {
@Override
public void onResponse(Call<HttpResult> call, Response<HttpResult> response) {
//根據(jù)返回的json解析出來并更新UI
if (response.body().getRet()!= null){
List<Bean> beans = response.body().getRet();
for (Bean bean : beans)
text += bean.getWord()+"\n";
activity.runOnUiThread(new Runnable() {
@Override
public void run() {
textView.setText(text);
}
});
}
}
@Override
public void onFailure(Call<HttpResult> call, Throwable t) {
Log.e(TAG, "onFailure: "+t.getMessage());
}
});
以上翻诉,就是調(diào)用阿里云OCR接口的核心代碼了。如果你還不清楚如何調(diào)用相機(jī)拍照并返回圖片的話巨朦,繼續(xù)往下看米丘。
3.Android調(diào)用相機(jī)拍照并返回圖片
① 在清單文件AndroidManifest里面申請權(quán)限。
<uses-permission android:name="android.permission.INTERNET"/>
<uses-permission android:name="android.permission.CAMERA"/>
<uses-permission android:name="android.permission.READ_EXTERNAL_STORAGE"/>
<uses-permission android:name="android.permission.WRITE_EXTERNAL_STORAGE"/>
在application中聲明FileProvide:
<provider
android:authorities="com.briana.aliocr.provider"http://自己的包名
android:name="androidx.core.content.FileProvider"
android:exported="false"
android:grantUriPermissions="true">
<meta-data
android:name="android.support.FILE_PROVIDER_PATHS"
android:resource="@xml/file_paths" />
</provider>
新建一個xml糊啡,命名為file_paths.xml拄查。
<?xml version="1.0" encoding="utf-8"?>
<resources>
<paths>
<external-path
name="camera_photos"
path="." />
<!-- path設(shè)置為'.'時代表整個存儲卡 Environment.getExternalStorageDirectory() + "/path/" -->
</paths>
</resources>
② 在MainActivity中修改如下:
在調(diào)用相機(jī)拍照前,判斷是否擁有權(quán)限棚蓄,沒有權(quán)限堕扶,就去申請。
private static final int PERMISSIONS_REQUEST_CODE = 1;
private boolean hasPermission(){
if (ContextCompat.checkSelfPermission(this, Manifest.permission.WRITE_EXTERNAL_STORAGE) != PackageManager.PERMISSION_GRANTED
|| ContextCompat.checkSelfPermission(this, Manifest.permission.READ_EXTERNAL_STORAGE) != PackageManager.PERMISSION_GRANTED
|| ContextCompat.checkSelfPermission(this,Manifest.permission.CAMERA) != PackageManager.PERMISSION_GRANTED) {
ActivityCompat.requestPermissions(this, new String[]{Manifest.permission.WRITE_EXTERNAL_STORAGE, Manifest.permission.READ_EXTERNAL_STORAGE, Manifest.permission.CAMERA}, PERMISSIONS_REQUEST_CODE);
return false;
}else {
return true;
}
}
重寫onRequestPermissionsResult方法梭依,查看請求權(quán)限結(jié)果是否被用戶通過稍算,如果通過,就調(diào)用takephoto()方法拍照役拴。
@Override
public void onRequestPermissionsResult(int requestCode, @NonNull String[] permissions, @NonNull int[] grantResults) {
super.onRequestPermissionsResult(requestCode, permissions, grantResults);
if (requestCode == PERMISSIONS_REQUEST_CODE) {
if (grantResults.length > 0) {
for (int grantResult : grantResults) {
if (grantResult == PackageManager.PERMISSION_DENIED) {
return;
}
}
takePhoto();
}
}
}
調(diào)用相機(jī)拍照糊探,并將圖片路徑記錄下來:
private static final int CAMERA_REQUEST_CODE = 2;
File mFile;
Uri imageUri;
private void takePhoto(){
if (!hasPermission()) {
return;
}
File path = new File(Environment.getExternalStorageDirectory(),"img");
mFile = new File(path,System.currentTimeMillis()+".jpg");
try {
if (!path.exists())
path.mkdir();
if (!mFile.exists())
mFile.createNewFile();
} catch (IOException e) {
e.printStackTrace();
}
if (Build.VERSION.SDK_INT >= Build.VERSION_CODES.M) {
String authority = getPackageName() + ".provider";
imageUri = FileProvider.getUriForFile(this, authority, mFile);
} else {
imageUri = Uri.fromFile(mFile);
}
Intent intent = new Intent(MediaStore.ACTION_IMAGE_CAPTURE);
intent.putExtra(MediaStore.EXTRA_OUTPUT,imageUri);
startActivityForResult(intent,CAMERA_REQUEST_CODE);
}
重寫onActivityResult方法,根據(jù)路徑取得圖片河闰,顯示在imageView上科平,再調(diào)用阿里的接口進(jìn)行圖片文字識別。
@Override
protected void onActivityResult(int requestCode, int resultCode, @Nullable Intent data) {
super.onActivityResult(requestCode, resultCode, data);
if (requestCode == CAMERA_REQUEST_CODE) {
Bitmap photo = BitmapFactory.decodeFile(mFile.getAbsolutePath());
imageView.setImageBitmap(photo);
AliOcr aliOcr = new AliOcr();
aliOcr.getText(this,photo);
}
}
給按鈕添加點擊事件監(jiān)聽姜性,點擊拍照:
button.setOnClickListener(new View.OnClickListener() {
@Override
public void onClick(View view) {
takePhoto();
}
});
如果對你有幫助的話瞪慧,給個贊吧~